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ABSTRACT 



This dissertation reports research about the phase space perspective for solving 
wave problems, with particular emphasis on the phenomenon of mode conversion in 
multicomponent wave systems, and the mathematics which underlie the phase space 
perspective. Part I of this dissertation gives a review of the phase space theory of 
resonant mode conversion. We describe how the WKB approximation is related to 
geometrical structures in phase space, and how in particular ray-tracing algorithms 
can be used to construct the WKB solution. We further review how to analyze the 
phenomena of mode conversion from the phase space perspective. By making an 
expansion of the dispersion matrix about the mode conversion point in phase space, 
a local coupled wave equation is obtained. The solution of this local problem then 
provides a way to asymptotically match the WKB solutions on either side of the 
mode conversion region. We describe this theory in the context of a pedagogical 
example; that of a pair of coupled harmonic oscillators undergoing resonant con- 
version. Lastly, we present new higher order corrections to the local solution for 
the mode conversion problem which allow better asymptotic matching to the WKB 
solutions. The phase space tools used in Part I rely on the Weyl symbol calculus, 
which gives a way to relate operators to functions on phase space. In Part II of 
this dissertation, we explore the mathematical foundations of the theory of sym- 
bols. We first review the theory of representations of groups, and the concept of a 
group Fourier transform. The Fourier transform for commutative groups gives the 
ordinary transform, while the Fourier transform for non-commutative groups relates 
operators to functions on the group. We go on to present the group theoretical for- 
mulation of symbols, as developed recently by Zobin. This defines the symbol of 
an operator in terms of a double Fourier transform on a non-commutative group. 
We give examples of this new type of symbol, using the discrete Heisenberg-Weyl 
group to construct the symbol of a matrix. We then go on to show how the path 
integral arises when calculating the symbol of a function of an operator. We also 
show how the phase space and configuration space path integrals arise when consid- 
ering reductions of the regular representation of the Heisenberg-Weyl group to the 
primary representations and irreducible representations, respectively. We also show 
how the path integral can be interpreted as a Fourier transform on the space of 
measures, opening up the possibility of using tools from statistical mechanics (such 
as maximum entropy techniques) to analyze the path integral. We conclude with a 
survey of ideas for future research and describe several potential applications of this 
group theoretical perspective to problems in mode conversion. 
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TOPICS IN MODE CONVERSION THEORY AND THE GROUP 
THEORETICAL FOUNDATIONS OF PATH INTEGRALS 



Part I 

The Phase Space Theory of 
Resonant Mode Conversion 



2 



CHAPTER 1 



Introduction 

1.1 Introduction to Part I 

In the first part of this dissertation, we describe the phase space theory of 
mode conversion. Chapter [2] introduces the phase space point of view for solving 
generic wave equations. We will then review the WKB method for construction of 
approximate solutions, and show how WKB methods are connected to phase-space 
ray-tracing algorithms. Then, in Chapter [3l the example of two coupled oscillators 
will be given. If the natural frequencies of the oscillators are time dependant, and 
cross at some time, then this problem can be recast into a form which is mathe- 
matically very similar to a mode conversion problem. The complete description of 
the solution of this coupled oscillator problem provides a pedagogical introduction 
to the phase space techniques used in the theory of mode conversion. 

In Chapter |U we will apply these tools to a standard avoided crossing mode 
conversion. Usually, the solution of such a problem involves the linearization of 
the dispersion function about the mode conversion point. A local solution is then 
constructed so that incoming and outgoing WKB solutions can be asymptotically 
matched, allowing the problem to be treated as a sort of "ray splitting". We will 
analyze the effects of the next higher order terms, and show how to construct a 
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new local solution which takes these quadratic terms into account. By including 
the effects of the quadratic order terms, the region in which the matching can be 
performed will be enlarged substantially. 

1.2 Introduction to Part II 

The phase space theory described in Part I of this dissertation makes extensive 
use of the theory of symbols of operators. In Part II, we describe the mathematical 
foundations of the theory of symbols. Because these mathematical foundations rely 
heavily on the theory of representations of groups, we will first give a review of group 
theory in Chapter [71 The example of the Heisenberg-Weyl group will be reviewed in 
detail, and we will show how the relationship between phase space and configuration 
space arises from the reduction of the regular representation of this group. 

Then, in Chapter [HI we will describe how the symbol of an operator can be 
calculated by a double Fourier transform. The operator is first embedded into a 
section of the dual bundle, which is like an operator-valued "function" on the set 
of irreducible representations of a non-commutative group. The non-commutative 
Fourier transform is then applied to convert this section into a function on the group, 
where the group is considered as a set. Then, using a commutative group structure 
on this same set, we perform another Fourier transform. The result is an ordinary 
complex-valued function, which is the Zobin symbol of the operator we started with. 

Using this definition of the symbol, as developed recently by Zobin, we will cal- 
culate the symbol of a function of an operator in Chapter [3 In particular, we will 
consider the exponential of an operator, defined using a power series. The symbol 
of the A^*^ power of an operator will be computed by using the A^*'^ star product 
of the symbol of the operator. In the limit of large A^, this repeated application of 
the star product can be written as a path integral, where the paths live in the dual 
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to the commutative group. We then illustrate this general theory of path integrals 
by explicitly calculating the repeated star product for the discrete Heisenberg-Weyl 
group. This group has irreducible representations which are n x n matrices, so the 
calculation presented can be used to calculate functions of a matrix. In particular, 
the exponential of a matrix will lead to a discrete "path integral" , which by grouping 
similar paths can be written in terms of a multiplicity function. This will lead to 
the consideration of the connections between path integrals and statistical mechan- 
ics. In addition, the multiplicity function gives rise to a probability distribution, or 
measure, on the space of all possible paths, which will allow the path integral to 
be interpreted as a Fourier transform on the space of measures. For the continu- 
ous Heisenberg-Weyl group this becomes an infinite-dimensional Fourier transform. 
Considering the path integral for the Heisenberg-Weyl group will also show which as- 
pect of group theory underlies the connection between the phase-space path integral 
and the configuration-space path integral. Specifically, the reduction of the regular 
representation to the primary representations leads to consideration of functions on 
phase space. This leads to the phase-space path integral. Further reduction of the 
primary representations to irreducible representations involves functions on configu- 
ration space. This reduces the phase-space path integral to the configuration-space 
path integral. 

The new group-theoretical approach to path integrals which will be described 
in Chapter [9] has many potential applications. In Chapter [10] we will outline several 
ways which this new point of view could be applied to mode conversion theory. This 
chapter will point out several avenues of current and future research. 

We will first see how consideration of the star product formulation of the path 
integral may lead to using the diagonals of the dispersion matrix as ray hamiltonians 
for constructing WKB solutions to vector wave problems. This requires the defini- 
tion of a new "normal form" for the dispersion matrix, where the symbols of the 
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diagonal elements Poisson commute with the symbols of the off-diagonal elements. 
In addition to simplifying ray-tracing algorithms, this could also help provide physi- 
cal insight for vector wave problems with non-standard mode conversion geometries. 

A second avenue of research based on our new group theory perspective involves 
calculating a "double" symbol for vector wave problems. The wave operator for 
vector wave problems can be written as a matrix of (pseudo) differential operators. 
The ordinary symbol of each element of the matrix can be calculated, giving the 
dispersion matrix as a function of phase space. However, as we will see in Chapter |8l 
it is possible to calculate the "symbol" of a matrix. So we can calculate the discrete 
"symbol" of the dispersion matrix at each point in phase space. This will give a 
new "double symbol" of the wave operator, which is a function of several discrete 
variables in addition to being a function of the phase space variables. 

As a final potential topic of further research, we will discuss in Chapter [10] the 
possibility of using an averaged Wigner function to model the effects of turbulent 
plasma fluctuations on mode conversion. This approach is based on the connection 
between the Wigner function and the density matrix in quantum mechanics. In 
quantum mechanics, mixed state density matrices are used to model the decoher- 
ence of a quantum state due to interaction with the environment. This decoherence 
makes the Wigner function for a mixed state look like a more classical probability 
distribution on phase space. We will suggest that a "mixed state" Wigner function 
could be used to describe mode- converting waves in a turbulent plasma. Prelimi- 
nary calculations suggest that this would make the Wigner function appear more 
"classical", with amplitude confined to regions near the dispersion curves for the 
various wave modes. 



CHAPTER 2 



A Brief Introduction to Phase Space 



Methods for Wave Equations 



From simple waves on a string, to the wave-functions of quantum particles, 
to the tensor waves of relativistic gravity wave theory, waves and their behavior 
influence almost all physical systems. In this work, the phenomenon of mode con- 
version is studied from the phase space perspective. Mode conversion occurs in 
multicomponent wave problems, when variations in the background medium allow 
local resonances between different types of waves. The resonances lead to energy 
transfer between the modes, which must be considered in order to understand the 
behavior of the system. The phase space for a wave problem is composed of physi- 
cal space and "wave- vector" space, just as classical phase space is composed of the 
position and momentum of a particle. 

A generic wave equation can be written as an operator acting on a field. The 
operator can vary in space and time, and the field may be multicomponent: 



This equation, together with fitting to appropriate initial and boundary conditions, 
defines a generic wave problem. Analysis of such a general wave equation is diffi- 
cult, and in many practical situations, a general solution may not be useful. Many 
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methods have been developed which give approximate solutions for cases of physical 
interest. Fourier methods are powerful for time-independent wave problems in a uni- 
form medium. Spectral techniques are powerful for analyzing stable configurations 
of quantum systems. The asymptotic theory of Wentzel, Kramers, and Brillouin 
connects solutions of wave problems (involving partial differential equations) to the 
solution of classical mechanics problems (which involve ordinary differential equa- 
tions). This so-called WKB theory, or WKB method, brings together ideas from 
Hamiltonian mechanics, asymptotic wave theory, the mathematical theory of op- 
erators, and the theory of path integrals. Further extension of WKB theory in 
the 1960's by Maslov and Arnold showed that underlying the theory are geometric 
structures in wave phase space. These objects, called Lagrange manifolds, deter- 
mine the phase fronts and amplitude variation of asymptotic solutions of the wave 
equation. While originally developed for solving the scalar Schrodinger equation of 
quantum mechanics, the WKB method can also be employed to solve multicom- 
ponent problems, such as those which arise when solving Maxwell's equations in a 
plasma. There is a large literature which shows that a ray-tracing approach based 
on the WKB method can be used to obtain asymptotic solutions to mode conver- 
sion problems, both in one spatial dimension l2, y, and in multiple spatial 
dimensions p, [g, [Zl, [sl, [9 1 . 

In the following chapters, I first describe and illustrate this phase space the- 
ory with an analogy of a resonance crossing problem from classical mechanics; the 
problem of two coupled harmonic oscillators with time-varying natural frequencies 
(Chapter [3]). This serves to introduce basic notation and concepts we will need in 
later chapters. Because of the introductory nature of this material, we will use a 
heuristic, one-dimensional, approach to this problem, and ignore the effects of caus- 
tics. In addition, we will assume that all operators are self-adjoint. The intuition 
gained from this pedagogic introduction is then apphed in Chapter |4] to the phase 
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space theory of waves, and is used to study higher order corrections to solution of 
the mode conversion problem. 



2.1 Phase space theory for scalar waves 

The idea of using a "phase space" to analyze a wave problem can be introduced 
using the example of a one dimensional, dispersive, scalar wave equation. Take, for 
example, the Schrodinger equation for a free particle in one dimension, 

Bib d'^ib 

This equation has the form of an operator acting on a function, and the solutions 
are eigenfunctions with eigenvalue zero. The wave operator is 

« - 4 ^ klr (2-3) 
Solutions of the Schrodinger equation are plane waves, characterized by a frequency 
10 and wavenumber k. Acting on the plane wave with the wave operator gives a 
relationship between frequency and wavenumber which must be satisfied in order 
for the wave to be a solution: 

So, solutions must have 

D(k,u;) = hLU = 0, (2.5) 

2m 

where the function D{k,uj) is called the dispersion function. Solving this equation 
for u gives us the dispersion relation 

^ (2.6) 

Comparing the operator (12. 3p with the dispersion function (12.51) shows that they 
are equivalent if we can make a correspondence between operators and functions of 
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{uj,k): 

u! — > idt, k —idx- (2.7) 

This correspondence is familiar from the semiclassical theory of quantum mechan- 
ics, but also forms the basis for analyzing generic wave equations. While studying 
the equations of quantum mechanics, Wigner, Weyl and others developed a way to 
relate a generic operator with a function in such a way that the function contains 
all information about the original operator. In the context of this theory, the dis- 
persion function (12. 5p is called the symbol of the wave operator (12. 3p . A generic 
dispersive wave equation, even one with non-constant coefficients, can be analyzed 
using symbol theory. If we write the generic equation as 

i){x,-tdx-t,tdt)ij{x,t) =0, (2.8) 

then the symbol of the wave operator is the dispersion function 

D{x,k;t,u;). (2.9) 

As in Equation (12. 5p above, eikonal solutions of the wave equation must have the 
relationship between, x, k, u, and t given by 

D{x,k;t,u;) = 0. (2.10) 

This defines a three dimensional dispersion surface in the four-dimensional wave 
phase space (x, k, t, u). 

In many problems that we will consider, the wave equation may be time inde- 
pendent. This means that the t dependence drops out of the dispersion function, and 
we can use the Fourier transform to separate frequency components. The frequency 
is then a parameter, and the dispersion function is a function on the two-dimensional 
phase space (x, k). The dispersion surface is then a one-dimensional curve in phase 
space given by finding roots of 

D{x,k;uj) = 0. (2.11) 
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If we solve for uo, this gives uo = k), which is the local dispersion relation. 

If the dispersion function is only smoothly dependent on the phase space vari- 
ables, then it might make sense to try using a Taylor's series expansion of the dis- 
persion function to get an approximate wave equation. We will always assume that 
this expansion will be possible. Such an expansion will in fact give an approximate 
equation for the envelope of the wave when the carrier is given by the wavenumber 
and frequency about which we are expanding: 

dD dD dD 

D{x, k; uj) = D{xo, ko; ujq) + —{x - Xq) + —{k - ko) + —{u - Uq) + . . . , (2.12) 

where all the derivatives are evaluated at Xo,ko, and Uq. If we consider problems in 
a uniform medium, the x derivative of the dispersion function is zero. The rule for 
associating operators and functions (Weyl prescription) automatically symmetrizes 
terms involving products of x and k (for more details see Chapter [8]). In this case, 
since we have a uniform, time independent medium, the terms d^D and d^^D are 
independent of x and t, respectively, so we do not need to symmetrize. Using the 
correspondence in Equation (12.71) to turn this back into an operator gives the wave 
equation 

D{xo, ko; ujo) + ^("^^^ " ^o) + ^(^^t ' ^o) j -^{x, t) ^ 0. (2.13) 

Setting the first term to zero gives the dispersion function for the carrier of our 
solution. Insert this back to obtain an equation for the envelope. 

° ^ (|^(-^^x.-M + |^(^5*-^o))e^(^°™*)V;(x,t) (2.14) 
= ze^^'^"™*) (^^^ + (^^^ d?j ^(x, t) (2.15) 

This now looks like an advection equation, with the characteristic velocity 

dD/dk 
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This is an alternative form of the famihar group velocity where il{k) is the 
dispersion relation. We prefer to work with the dispersion function D{x, k; u) rather 
than the dispersion relation because it is easier to interpret geometrically in higher 
dimensions. The characteristic curves for Equation (12.151) are given in parametric 



form by 



io| 



d^_dD_ dx_ _dD_ 
da duj ' da dk 

If we now consider the case where the background is weakly nonuniform, then the so- 
lution will locally look like a plane wave, but with the wavenumber slowly changing. 
This means that we can approximate the solution as 

4j{x, t) = A{x, t)e^(®(^)-^o*), (2.18) 

where the envelope function A{x, t) varies slowly compared to the phase. Acting on 
this with —idx brings down a derivative of the phase, 

-idx^{x,t) = -za,v4(x)e*(®(")""«*) = e'(®(")-'"°*H-za,A(x) + A(x)9,e(x)). (2.19) 

We can define the function 

k{x) = 9,e(a:), (2.20) 
and then expand the dispersion function about (x, k{x)) to obtain 

symmetrize 



dD dD 
^ [D{x,k{x)-uj^) + —{-idx-k{x)) + —{idt-uj^)]i^{x,t) (2.21) 



symmetrize 



= e^(®(^)"-«*)(^D(x, A;(a;);a;o) + |^(-^^x) + ^{zdt)^ A{x,t). (2.22) 

If the spatial gradient of D is small, then, as before, we can write the derivative 
terms as a total derivative along some curve. Assuming a time-independent medium. 
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we can write 

symmetrize 



(9 D dD 7 

-Qj^Hd,) + —{zdt) = i{d^D)dt - -{d^dkD + 2{dkD)d,) (2.23) 

^tid^D)dt-tidkD)d,. (2.24) 

Here we have assumed that d^dkD is small compared to the other terms. In the 
next section we will consider the effects of this term, but for now we have again 
obtained an advective derivative. Equation fl2.17p still gives the parametrization of 
the characteristic curves for this advective derivative. We now need to ensure that 
the first term in Equation (I2.22p remains zero along the these characteristic curves: 

Q_dD _ dDdx dD dk 

da dx da dk da 

_ dD f dD\ dDdk 

dx \ dk J dk da 
This implies that the function k{x{a)) can be found as a function of a by solving 

Notice that this equation, together with the derivative of x from Equation fl2.17p . 
has the form of Hamilton's equations, where —D{x,k]ujQ) acts as the hamiltonian 
function. Solving for (x, k) from these equations, with a family of initial conditions, 
gives us a surface in phase space called the Lagrange manifold. Integrating k along 
this surface allows us to find the function 0(x), which can be used to find the phase 
of the solution as given in Equation fl2.18p . For a 2n-dimensional phase space, 
the dispersion surface defined by D{x,k) = is (2n — l)-dimensional. Embedded 
in this dispersion surface is the Lagrange manifold, which is n-dimensional, since 
the dimension of the Lagrange manifold is equal to the dimension of configuration 
space, i.e., half the dimension of phase space. Each ray is only one-dimensional, 
and so in general a family of rays is required in order to trace out the Lagrange 
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manifold. Notice that in problems with only one spatial dimension, these geometric 
structures collapse. Phase space is now only two-dimensional, and the dispersion 
surface, Lagrange manifold, and rays are all one-dimensional, and lie on the same 
curve in phase space. 

The minus sign appearing in the ray equations when using the dispersion func- 
tion D{x, k; Uq) as the ray hamiltonian may seem somewhat unusual. This is because 
we are using the dispersion function D to generate the rays, instead of using the 
dispersion relation uj{k) as the ray hamiltonian. In order to see how the dispersion 
relation gives us a different sign, consider the far field solution of a time independent 
problem in a uniform medium. The wave equation is 

D{-id,,idt)^Pix,t) = 0. (2.28) 

The Fourier transform of this gives the integral equation 

dk e'('="-"('=)*)D(/t, ujik))^{k) = 0. (2.29) 



This means that the dispersion relation comes from solving D{k, uj{k)) = 0. We can 
then analyze the Fourier integral which gives the general solution: 

^{x,t)= I dke'^^''-'^^^^''^{k). (2.30) 



We can find the group velocity as a function of k by using the stationary phase 
approximation to estimate this integral. Substitute x = Vgt into the integral, where 
Vg is a free parameter, and consider the limit t ^ oo. For a choice of Vg, the 
stationary phase point of this integral is given by 

= ^ikvg-u;{k)) = vg-^. (2.31) 

This fixes k given Vg, and implies that 
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Comparing this with Equation (12.171) shows the relative difference in sign. When this 
is substituted into the local dispersion relation uj{x, k{x)) to obtain the parametric 
equation for k, we also find the sign difference: 

^ duj duj dx doj dk ^ 

dt dx dt dk dt 

duj ( duj\ doj dk 

\dk) ^ dk~di ^ ■ ' 

dk duj 

Here ^ is the total derivative following a ray. Although using the dispersion relation 
gives a sign difference in the ray equations as compared to the equations obtain from 
the dispersion function, the resulting solutions are the same. This is due to the fact 
that the important physical object is the Lagrange manifold in phase space, not 
the rays which were used to find it. The ray parameter a has no real physical 
significance. However, care must be taken when using the ray parameter, and when 
relating it to real time via Equation (I2.17p . so that sign errors are not introduced. 



2.2 Eikonal ansatz and the WKB approximation 

The phase space picture introduced in the previous section provides the setting 
for the WKB method of constructing approximate solutions to wave equations such 
as (12.81) . The WKB method starts with a trial solution in Equation (12.181) . which is 
called an eikonal wave: 

^(x, t) = A{x, t)e^(®(-)— 0*). (2.36) 

We are assuming that the real amplitude A{x, t) varies slowly compared to the 
phase G(x). Since the phase is the only quantity that varies rapidly, the action of 
the derivative is approximately the same as multiplication by the derivative of the 
phase. In this paper, when we refer to the WKB approximation, this is what we 
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mean; that the solution is in eikonal form, and the derivative goes to multiphcation 
by the gradient of the phase. 

We now examine the expansion f l2.22p of the dispersion function more closely. 
Since we are expanding about {x,k{x)), the partial derivatives of the dispersion 
function may still depend on x. This means that when we convert the symbol of 
the dispersion function into an operator, we need to symmetrize terms containing 
derivatives with respect to x. In particular, we get 

{dkD)k ^ ^(^{dkD)k + k{dkD)) (2.37) 
= -'-{2{dkD)d, + {dAD)). (2.38) 

We can now insert this symmetrized term into the expansion in Equation f l2.22p . 
Using the fact that D{x,k{x);uJo) = and that d^Q = k{x), we can simplify the 
equation to 

i{d^D)dtA - '-{dAD)A - i{dkD)d,A = 0. (2.39) 

We can now look for solutions whose only time dependence is in the phase e"*'^"*. 
This means that we can take dtA = 0, and the two remaining terms in the above 
equation can be written as a logarithmic derivative: 

-^dJn{dkD)=dJnA. (2.40) 

Integrating this along a ray gives the WKB amplitude: 

A{x) = Ao [d,D\,^,^^y^\ (2.41) 

where Aq is a constant of integration. The WKB phase is obtained by integrating 
k{x) along a ray: 

X 

e(x) = J k{x') dx' = j H(^)^ da. (2.42) 
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2.3 Rayt racing 

The WKB solution presented in Section 12.21 depends on being able to calculate 
the Lagrange manifold. From this surface, we can find k{x), which is then integrated 
to give the phase function 6(x). Raytracing is the primary method used to find the 
Lagrange manifold. In this section, we give a review of the raytracing method for 
the case where the configuration space x is one dimensional. 

We start with an approximate solution near the point xq (see Figure [2TT]) . The 
solution locally has the form 

= A(xo)e'('=«"-"°*). (2.43) 

The wavenumber must satisfy the condition that the dispersion function is zero, 
D{xo, ko] Uo) = 0. In general, this equation may have several roots, which correspond 
to different wave modes, and so choosing ko determines which type of solution we 
will construct. 

Now use the point in phase space (xq, ^o) as initial conditions for a ray. The ray 
trajectory is determined by Equations fl2.17p and fl2.27p . These equations can be 
solved (numerically in most cases) to give the curve (a; (a), k{a)). Since phase space 
is only two dimensional in this calculation, this one ray traces out the Lagrange 
manifold, whose dimension is half that of phase space. Also, the dispersion surface 
has dimension one less than the dimension of phase space, and so this one ray also 
traces out the dispersion surface. In general, the dispersion surface is embedded in 
phase space, the Lagrange manifold is contained in the dispersion surface, and the 
ray is a curve in the Lagrange manifold. 

Now that we have the solution {x{(j) , k{a)) , we can use it to construct the 
WKB solution. As in Equation 02.421) . the phase is determined by integrating k{x). 
Equation (12.411) gives the amplitude, where the constant of integration is set by our 
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FIG. 2.1: Example of a dispersion surfaee in a two dimensional phase space. In this case, 
one ray traces out the Lagrange manifold and the dispersion surface, starting from the 
point (xo,fco). At the turning point, A, the fimction k{x) is multivalued, and the WKB 
amplitude diverges because i = 0. 



initial conditions. Putting these together gives the WKB solution 



\dkD 


(s-'cfco) 


\dkD 


(x,k{x)) 



iIj{x, t) = A{xo) \ / ' '^^^'"'""^ exp ^ / k{x') dx' . (2.44) 



If the ray is parametrized such that x(0) = xq, and x{a) = x, then this equation 
can be written in terms of the ray parameter as 



tPix, t) = A(x(0)) . / exp I ^ / kia')x{a') da' I , (2.45) 



xia 



with 



x{a) = ^. (2.46) 
da 



From this form of the equation, it is evident that the WKB amplitude will diverge 
if x{a) — > 0. Such points are called caustics, and occur, for example, at turning 
points. 
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2.4 WKB Method and Raytracing for Vector Waves 



The WKB method for solving multicomponent wave problems is modeled on 
the asymptotic methods described in the previous sections for scalar waves. Mul- 
ticomponent wave problems arise in many different situations. For example, waves 
in a fluid or gas can be written in terms of the (multicomponent) velocity field. 
Depending on the types of waves being considered, we may also need to consider 
additional scalar fields such as the fluid density or temperature. For plasmas, the 
electric and magnetic vector fields also need to be considered. Gathering all these 
components together would in general give us a wave equation in the form of an 
N X N matrix operator acting on the components of the field. For example, a 
two component wave equation can be written as 

\ 



D ■ il^{x,t) 



V 



D21 D22 



I 



\ 



0, 



(2.47) 



where D is the wave operator (a 2 x 2 matrix of operators), and il){x, t) is the field 
(a 2-component vector). 

We can now introduce a vector form of the eikonal wave, and include a slowly 
varying polarization vector e{x): 



ii){x,t) = e(x)yl(x)e*(®(^')~^«*\ 



(2.48) 



The wave operator for vector problems is a matrix of operators, and its symbol is 
called the dispersion matrix. For the two-component case N = 2, the dispersion 
matrix is a 2 x 2 matrix: 



I){x,k;uJo) 



^ Dn{x,k;uJo) Di2{x,k;uJo) ^ 



(2.49) 



y D2i{x,k;uo) D22ix,k;uJo) J 
This matrix-valued function on phase space has local eigenvectors and eigenvalues 
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which are also functions of phase space: 

D{x,k]UJo)-ea{x,k]UJo) = Da{x,k]UJo)ea{x,k;uJo), a = l,2,...N. (2.50) 

The dispersion matrix can be expanded about a point just as the dispersion function 
was in Equation (12.121) : 

dD dD 

D(x, k; u) = D(xo, ko; Uq) + -g^i^ - Xq) + —{k - ko) + —{u - ujq) + . . . . (2.51) 

When converted back to an operator, the first term of this series gives the equation 

B{xo,ko]UJo)il^{x,t) = 0. (2.52) 

This can be written in using the eigenvectors of D(xo, ko; uq) as a basis, which gives 
(suppressing the Uq dependence in D and e^, for convenience) 

N N 

J2 D(a^o, ko)e^{xo, A:o)A„(x)e^(®"(^)--°*) = Da{xo, ko)e^{x, fco)A,(x)e^(®'^(^)--«*). 

o=l a=l 

(2.53) 

This equation imphes that in general, if V is going to be a nontrivial solution, 
then its polarization vector e(x) must be an eigenvector of the matrix D with zero 
eigenvalue. 

We now have a situation comparable to the scalar case discussed previously. 
Each function Da{x, k; ljq) defines a dispersion surface, and, as long as the dispersion 
surfaces for different polarizations are well separated in phase space, they each 
correspond to a different wave mode. The eigenvalues Da can then act as ray 
hamiltonians, and Equation (12.451) can be used to construct the WKB amplitude 
and phase corresponding to this polarizatiorP. 

If the dispersion surfaces for different modes come near each other in phase 
space, this implies that the dispersion matrix is nearly degenerate. Using the eigen- 



^The local eigenvectors of the dispersion matrix uniquely define the polarizations, modulo a 
non-holonomic phase . 
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vectors as polarizations for the different modes means that we are imphcitly diago- 
nahzing the dispersion matrix. This diagonahzation procedure would become unde- 
fined in the case of degenerate eigenvalues. For the nearly degenerate case when the 
dispersion surfaces are close to each other, this implies that the eigenvectors could 
be rapidly varying. If the eigenvectors vary rapidly, then the ordering assumptions 
used to construct the WKB solutions would no longer be valid. A region where such 
a breakdown occurs is known as a mode conversion region, since energy initially 
in one of the wave modes can be transferred into the other wave mode. This phe- 



nomenon is known by different names in different fie 
crossing or avoided crossing 
and resonance crossing 



ds, e.g., a Landau- Zener level 



131 ]. linear wave conversion 



1^, 



12| . surface hopping 
15|. In the next chapter we will examine a simple model 
which exhibits this behavior, and describe the mathematical tools used to deal with 
the mode conversion region in the context of ray-tracing algorithms. 



CHAPTER 3 



Coupled Oscillators: A Pedagogical 
Introduction 

3.1 The Wave — Oscillator Analogy 

Problems exhibiting mode conversion arise in wave systems where the back- 
ground medium is spatially varying. Consider, for example, the wave equation in 
conservation form 

t) - {c\x)d^^l){x, t)) = 0. (3.1) 

Here, the background variation is modeled via a wave speed which depends on 
position. Since there is no explicit time dependence, we can Fourier transform from 
t to u: 

-u'M^) - ^ (c\x)^Mx)] = (3.2) 
ax \ ax J 

This is an equation for the spatial profile of each frequency component. If we define 
the wavenumber k{x) = uj/c{x), we get 

1 f d 2/ d ~ ' d"^ 



m'M^) + ^ \-c'(.) y^M^) + ^fc(x) = (3.3) 
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If we can ignore the term with the derivative of c(x), then this equation is similar 
in form to the equation for a harmonic oscillator with a varying natural frequency. 



While the change of variables from {x,k{x)) to (t,uj{t)) is somewhat awkward be- 
cause of the previous use of the variables t and u, it is worthwhile since it lets us 
think of the original wave problem in terms of a simple harmonic oscillator, which 
is a common and well understood system. 

In this chapter we will analyze a pair of coupled oscillators with time-dependant 
frequencies. This pair of oscillators provides a simple example of the phase space 
techniques used to solve the mode conversion problem. There are two aspects of this 
setup which deserve special note, since they differ from standard harmonic oscillator 
problems. 

First, in this problem phase space is taken to mean the time-frequency (t,uj) 
plane. Thus the natural frequencies of the oscillators describe curves in phase space. 
Also, the Fourier transform of the solution 



is related to a rotation of phase space by 90°. (This connection of the Fourier trans- 
form to a rotation in phase space will be discussed at length in following chapters.) 
Unless otherwise noted, when the term phase space is used in the following, the (t, u) 
plane is meant rather than the (x, x) phase space of classical mechanics problems. 

The second aspect of this calculation which may be unusual is the use of complex 
values for x{t). The displacement of a physical oscillator is a real valued quantity. 
However, if we use Fourier methods to solve the problem, the most general solutions 
will be complex valued. The "physical" solution is then obtained by taking the 
real part of x{t). The Fourier method of expanding our field in complex valued 



x{t) +u^{t)x{t) = 



(3.4) 




(3.5) 
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plane waves arises naturally in this context. In general, the wave equations we want 
to solve will take the form D{x, —idx',t,idt)ip{x,t) = 0. Since Z) is a function of 
the operators —i-^ and i^, we would like to expand it in eigenf unctions of these 
operators if possible. In the best case scenario, this will diagonalize D, and give 
us the solutions to our problem. The eigenf unctions of these derivative operators 
are the complex functions e*'^^ and e"**^*, so a solution written with these as basis 
functions will in general be complex valued. 

With these two caveats in mind, we now proceed to the actual calculation. 

3.2 Outline of Calculation 

The outline of this calculation is as follows. First, we review the problem of 
two coupled oscillators, and discuss how the WKB approximation and raytracing 
methods can be used to solve the problem even when the natural frequencies of the 
oscillators are time dependent. 

Then, we discuss how these methods break down when the frequencies of the 
oscillators become nearly equal for some range of values, allowing energy to transfer 
from one oscillator to the other during this resonance crossing. We outline how we 
can use asymptotic methods to deal with this phenomena as if it were a scattering 
or ray splitting problem. 

We then use two different methods to find the local solution in the resonance 
region. We first show how a linearization near the resonance gives us the equation 
for the parabolic cylinder functions (this is a well known result). Then, we show how 
a linear canonical transformation of (t, uj) phase space, along with the associated 
metaplectic transformation of our solutions (to be defined), give a more intuitive 
picture of the linearization procedure. This phase space picture also allows the 
scattering parameters to be derived more easily from the local solutions. 
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Finally, we compare our full raytracing solution (including the ray splitting at 
the mode conversion) with numerical solutions of the original equations. 



3.3 Coupled Oscillator Review 

3.3.1 Problem Setup 

The problem we will examine is that of energy transfer between two coupled 
harmonic oscillators. We start with the equation of motion for a harmonic oscillator, 
such as Equation (13.41) . and introduce a second oscillator coupled to the first. 



Xi + ujl{t) Xi + r]X2 

'±2 + ^^{t) X2 + rjxi 







(3.6) 
(3.7) 



Here Xi{t) is the state of the i*'* oscillator, u!i{t) is the natural frequency of the 



■th 



oscillator given as a function of time, and the constant coupling between the 



oscillators is given by the coefficient rj. In many real coupled oscillator problems, 
the coupling would be proportional to a function of the difference between the 
oscillator's positions, e.g., r] (x2 — Xi). However, since we are trying to model mode 
conversion in waves, we will use the form above, without the difference, since that 
is how the coupling appears in many mode conversion models [g], 
understood shift in the definitions of ui and 1^2 • 



15|. This can be 



Equations 13.61 and 13.71 can be derived from an action principle 



A = j dt:>^\t) ■t>{t,idt) -^{t). 



(3.1 



Variation of this action with respect to x"'' gives the matrix form of the equations, 

/ 

ujt + ^ n xi 

= 0. (3.9) 



D x 



V 



V 

, ,2 I d2 
U2 + 
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The operator D can be written in terms of functions of the phase space variables 
{t, u) by calculating the "symbol" of the wave operator through the substitutioiil 
idt ^ uj. A full description of symbols of operators will be given in Chapter [HI 



D(t,a;) 



LUT — UJ 



\ 



V 



UJn — UJ 



(3.10) 



The equations of motion now have the form of an eigenvalue equation for a pair 
of coupled harmonic oscillators with slowly varying natural frequencies. In terms 
of the mass matrix M and the potential matrix V from classical mechanics 
Equation (13.91) is 



17| 



D ■ X = (V - Mu'^ 



0, 



(3.11) 



with 



M 







LU( 












+ 7] 




(3.12) 






^ ^ 




(i oj 





3.3.2 Constant Natural Frequencies 

Before proceeding to examine mode conversion in this problem, notice that the 
system in Equation fl3.9p can be solved exactly if the natural frequencies of the 
oscillators and the coupling are all constants. In this case, there are two eigenmodes 
that can be found by setting det(D) = 0. Solving for the eigenfrequencies gives 



4 = ^K' + ^2)±^ 



UJ. 



If 



Arf. 



(3.13) 



There is an eigenvector associated with each of these modes. These eigenvectors are 



± 



T] {ujI - Ujj) 1 ^ 



V 



1 



(3.14) 



/ 



^The sign in this substitution is set by considering the action of the derivative on a positive 



frequency plane wave. Since a plane wave is usually written as e we get idtc 
which gives us the sign shown above. 
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where M± is the normahzation. The most general solution for x(t) is then given by 

x(t) = C+e+e-*('^+*+*+) + C_e_e-'^^-'^'t'-\ (3.15) 

where the real amplitudes C± and phases 0± depend on the initial conditions. 

Later in this calculation we will encounter the limiting case where the coupling 
is small and the natural frequencies of the oscillators are significantly different. In 
this case, 

\r]\<^\ujl-ujl\ = \/\\. (3.16) 



The eigenfrequencies are then 



A2 

1^1' 
A2 



(3.17) 
(3.18) 



where u:^ (ct;<) is the larger (smaller) of the natural frequencies uji and uj2- If 
oji = ti;>, then the eigenvectors are approximately 



where 



{A 



(3.19) 



(3.20) 



If UJ2 = then we get 









/ 


1 \ 


e+ = 
















I 





UJ2 = UJ>. 



(3.21) 



If we start with all the energy in the first oscillator, that is equivalent to spec- 
ifying the initial condition 















, Xo = 




v 









(3.22) 
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Assuming that ui = ujy, we now decompose this onto the eigenmodes, and find the 
initial amphtudes and phases 

xo = A ((1 - |ene+ - re-) , 0+ = </)_= 0. (3.23) 

This means that the second oscillator is driven by the first and its solution is 

x^it) = ((1 - |ene"*^+* - e"^"'-*) . (3.24) 

The maximum amplitude that X2 can take on is approximately 

max(x2(t)) - 2A\^\ = (3.25) 

Since we are in the limit where ^ ^ 1, the magnitude of X2 is always small, so 
there is not much energy transferred between the oscillators. In this case, a good 
approximation to the solution is 



x(i) 



e~"^'\ (3.26) 



3.3.3 Varying the Frequencies ui and U2 

In order to model a mode conversion problem using coupled oscillators, they 
need to be set up so that their time-dependent natural frequencies become nearly 
equal at some time. Figure [3TT] gives a plot of the frequencies for such a system. In 
that figure, the dashed lines are the natural frequencies of the oscillators. Near time 
t = the frequencies are approximately linear functions of time, and at time t = 
they are equal. The dash-dotted lines are the eigenfrequencies, uj±, of the dispersion 
matrix. Far from the crossing, the eigenfrequencies are approximately equal to the 
specified natural frequencies of the two oscillators. Near t = 0, the eigenfrequencies 
form a hyperbolic structure called an avoided crossing. 

In any matrix problem, there is inherently a question as to which set of basis 
vectors should be used when formulating the problem and its solution. Do you stick 
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with the physical coordinates, or switch to a more appropriate set of generahzed 
coordinates? Or, asked a different way, which set of polarizations should be used to 
describe the wave? In many problems, the eigenvectors of the matrix wave operator 
are the natural basis vectors to use. For example, the "up" and "down" spin states 
correspond to energy eigenstates which are aligned with an applied magnetic field. 
Any other spin state can be written as a combination of these. Similarly, in a linear 
mode conversion problem, the two interacting modes provide a set of states which 
would seem natural to use as basis states. 

In our example of two coupled harmonic oscillators, each oscillator Xi has an 
amplitude that must be specified in order to give the complete state of the system. 
Using these two amplitudes as components of the state vector gives expressions such 
as Equation fl3.9l) . However, as illustrated by the calculation in Section [3.3.2^ it is 
often easier to solve the system if we choose a different set of basis vectors. In that 
section, by using the eigenvectors of the dispersion matrix as a basis, the solution 
decoupled into normal modes with distinct frequencies. It would seem that this 
normal mode basis would be the one to use when solving the full problem including 
mode conversion. (An aside concerning nomenclature; in the atomic scattering 
literature, the eigenvector basis with coupling is called the adiabatic basis, and the 
original basis without coupling is called the diabatic basis.) 

There are several problems with using the eigenvectors as a basis, however. 
First, there is the question of where exactly does the mode conversion occur? As 
seen in Figure fl3.ip . while the given natural frequencies {uji,uj2) cross at a well 
defined point in phase space, the coupled frequencies {u!+,U-) do not cross. This 
avoided crossing makes it more difficult to define the region in which mode conversion 
is occurring. 

Another problem with this basis arises when trying to define the polarization 
transport when using raytracing to solve the problem. The WKB approximation 
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FIG. 3.1: This plot show the natural frequencies of the oscillators (dashed lines) and the 
eigenfrequencies of the pair of coupled oscillators (solid lines) . The coupled frequencies 
are obtained by solving det(D(t, w)) = 0. Far from the mode conversion, the natural 
frequencies are approximately equal to the eigenfrequencies. In the mode conversion 
region, the effect of the coupling is more pronounced, giving rise to the avoided crossing 
structure. 



31 

assumes that the various quantities associated with the ray and the surrounding 
medium are slowly varying. However, near a mode conversion point, the polarization 
starts to vary rapidly. An involved calculation then becomes necessary to find 



3- 



Granted, the WKB 



the polarization of the transmitted and converted rays 
approximation is not valid near the mode conversion anyway, but it would make the 
calculation simpler if the polarization varied smoothly and simply across the mode 
conversion region. 



3.4 The WKB approximation and Ray tracing 

3.4.1 Eikonal ansatz and the WKB approximation 

In the previous sections, the solution to our problem was obtained for coupled 
oscillators with fixed natural frequencies. We now consider the possibility that 
these frequencies are not fixed, but could be slowly varying functions of time. If the 
frequencies vary slowly enough, then we expect the solutions to look very similar to 
the plane wave solutions in Equation (13.151) . for at least a few oscillation periods. 



14l |. Let's assume that the 



A trial solution of this form is called an eikonal ansatz 
solution will have this eikonal form 

x(t) = e{t)A{t)e''^'^ (3.27) 

where the polarization e{t) and amplitude A{t) vary slowly compared to the phase 
9{t). So the derivative of this with respect to time is approximately 

z|x(t) ^ -emit) e'^W. (3.28) 

Note that this eikonal form has an ambiguity in the phase of the polarization. Any 
change to the phase of the polarization can be absorbed by the phase e*^'-*-*: 

e^^We(t) = e'(^W+^)e-'^e(t) = e''' e' (t) . (3.29) 
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We adopt the convention that there is an arbitrary initial phase in the polarization 
vector, but any change in phase appears through variations in 0{t). 

We can now examine the action of our wave operator on our test solution. First, 
assume that the coupling is turned off. In this case, the polarization corresponding 
to either of the oscillators separately is an eigenvector of the matrix D. In general, 
the time derivative brings down a derivative of the phase: 

= D(t, idt) ■ e,(t)A(t)e'^W ~ e^{t)A{t)D^{t, -^(t))e'^W (3.30) 

Here, D^it, —9) is the eigenvalue of the matrix D, where the time derivatives have 
been replaced by 9, turning the operator into a function of t. In order to obtain a 
solution, we need 

Da{t,-9{t)) = Q. (3.31) 

For example, if we are considering solutions for xi(t) in the limit 77 0, then we 
take the first diagonal element of D, and replace the derivatives with 9 to obtain 

el ■ D ■ ei = L)ii(t, -9{t)) = ujl{t) + 9^{t) = 0. (3.32) 

This implies that the phase is given by the integral of the natural frequency. 

t 

9{t) = - j uji{t')dt' (3.33) 

The amplitude as a function of time can now be derived from an approximate 
action principle for the wave equation. Insert the approximation from Equation 
fl3.30p into the action in Equation (13.81) to obtain a new approximate action 

A = J dt Dii{t,uj = -9)\A{t)\'^. (3.34) 

This action depends on 9(t) only through its derivative, and so the action is not 
changed by adding an arbitrary constant to 9. This means that there is a Noether 
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symmetry, and an associated conserved quantity. Varying with respect to 9 and 
assuming that Du is a smooth function of uj imphes that 

d ( dDn 



dt \ duo 



=0. (3.35) 

t,UJl{t) 



For the problem being considered here, Equation fl3.10p gives us Du = {uf — cu^), 
which means that 

cJi(t)|A(t)|2 = const. (3.36) 
This means that the amphtude of Xi must vary hke 



, ,m 



where A™ and cu™ are constants. Together, Equations (13.371) and (13.331) give the 
WKB solutions for the amplitude and phase of the oscillator. The polarization vector 
e{t) is given by the instantaneous eigenvector ea{t) associated with the eigenvalue 
we are considering. In order to fully define the polarization, the slowly- varying phase 
7 of the polarization must also be defined: 

e(t) = e^^We„(t). (3.38) 

The equation for the transport of this phase along a ray can be calculated from the 
next higher order terms which were dropped from Equation (13.301) . For details of 



this calculation, see 



18|. 



3.4.2 Raytracing 

The idea behind the raytracing method is to recast the wave problem into the 
form of a Hamiltonian mechanics problem. There are many ways to do this. We 
will use the tools of the "symbol calculus" (see Chapter [8] and also S. MacDonald's 



excellent review paper 
ray-tracing method. 



19|). In this section, we outline the basic ideas behind the 



34 

Start by considering the instantaneous eigenvectors ea{t), as in the previous 
section. Use these vectors to decompose an eikonal wave: 

V'(^) = $^e«(^)A„(t)e*^'^W/^ (3.39) 

a 

Here we have introduced e as an ordering parameter. In the following we will consider 
the polarizations separately. Instead of using an approximation as in Equation 
fl3.30p . we can use the ordering parameter to calculate the action of a derivative: 

eidte^{t)A^{t)e''-^'^/' = e^^'^W/^(-^„ + etdt)e^{t)A4t). (3.40) 

Substitute this expression into the wave equation: 

D(t, eidt)e^{t)A^{t)e'^-^'^/' = e'^"W/^D(t, -9 + eidt)e^it)A^{t) = 0. (3.41) 

Use the fact that is an eigenvector to construct a scalar wave operator: 

D^ = ei-I)- e„. (3.42) 

This gives a scalar equation for the mode a: 

Da{t, -e + eidt)Aa{t) = 0. (3.43) 

The WKB approximation says that the magnitude of 6{t) is larger than the magni- 
tude of any other derivatives that appear in this equation, and the ordering param- 
eter e makes this explicit. If we could think of the derivative as a variable instead 
of as a function, then we could use the Taylor's series expansion to obtain an ap- 
proximation to this equation. This can in fact be done by using the symbol of the 
wave operator, Da(t,uj), as in Equation fl3.10p . Expand the symbol about u = —6 
using the Taylor's series, and then convert it back into an operator. Since d^D^ is 
a function of time, the ordering of the multiplication and differentiation operators 
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must be symmetrized. We therefore have 



[idt) + idt 



ei f d f dDa 
2" \ Jt 



duo 



duj 



duo 



(3.44) 
(3.45) 



Truncating this series at 0(e), and putting it into Equation 03.431) gives us 



dD 

D^{t,-e)A{t) + 2e' " 



duo 



LU = -9 



1/2 



d 

dt 



duo 



1/2 



Ait) 



0. 



(3.46) 



In order for this to equal zero, both terms must be zero. This gives us equations for 
e{t) and A{t). 

Da{t,-9) = (3.47) 
d ( f dDa 



1/2 



A{t) 







(3.48) 



dt \\ duo 

The first of these equations is solved using ray-tracing techniques, while second 
equation gives us the standard WKB formula for the amplitude. 

Because Da is a function on phase space, we are looking for the curve in the 
{t,uo) plane along which Da{t,uo) is zero. If we write a parametric equation for this 
curve, then we will have the "ray" {t{a),uo{a)), where a is the ray parameter, which 
plays the role of time from classical mechanics. 

If we start with a point in phase space such that Da{to,uoQ) = 0, then the 
equations of motion for the ray can be derived by enforcing the condition that D^ 
remains constant along the ray. 



dDa dDa dt dDa duo 

= — - = \ — 

da dt da duo da 

We can choose the parameterization as we like, so choose 

dt _ dDa 
da duo 



(3.49) 



(3.50) 



That leaves the equation 



dDn ( dD„ duj\ 



- 



did \ dt da J 
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(3.51) 



or 



^ = (3.52) 
da dt ^ ^ 

Notice that Equations ( ]3.50p and ( 13.52^ are simply Hamilton's equations, with 

Da(t,uj) as the Hamiltonian function for the ray as with the scalar case discussed 

in Section 12. 1[ Once these equations have been solved for the ray, we can integrate 

along the ray to find the phase: 

t' final 

e{t') = - J uj{t) dt = - J tu(a)^ da. (3.53) 

Here, (Xfinai is set so that t(crfinai) = t'. 

We have now solved for the phase using the 0(e°) equation, and the amplitude 
using the 0{e^) equation. There is also slow variation of the polarization, which 
comes in at O(e^). For this calculation, see Kaufman et al. 18 1. 

If these equations are used to solve the uncoupled oscillator problem, we obtain 
the WKB results of Equations (13.331) and (I3.37p . since for this case the polarization 
vectors are constants. 



3.4.3 WKB with coupling 

If we return to the oscillator problem including the coupling, then an additional 
C(?7^) term will appear in the phase of the WKB solution. In the coupled case, the 
eigenvalues are no longer equal to the diagonal elements of the dispersion matrix 
in Equation (13.101) . This means that we have to use the instantaneous eigenvalues 
and eigenvectors from Equations (I3.13P and (13.131) when we construct the WKB 
solutions. 
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Assuming ui = ujy, we can use the expansion for the eigenvalues given in 
Equation fl3.17p to write ujj^ as 



2 



= u,{t) + — — ^ ^ + 0{ti^). (3.54) 

This expression for uj can be expressed directly as a function of time by linearizing 
oji and UJ2 about t = 0: 

uji^2 ^ ^0 + t ^i,2{t = (3.55) 

Putting this into the equation for uoj^ gives an expression where the denominator is 
correct to 0{t^): 

2 ~2 

u+{t) ^ uj,{t) + . = ^i(t) + \. (3.56) 

4ci;q(co'i — ULi2)t t 

This approximation is good away from the mode conversion, but breaks down close 
to the mode conversion, where t — * 0. See Figure 13.21 Note that the denominator 
of the coupling term is the Poisson bracket of the diagonals of D. We will see in 
Section 13.6.21 that the bracket arises because of a change of variables in phase space 
which is performed in order to make the diagonal elements of D look like a canonical 
pair. 



3.5 Mode Conversion Setup 

We make several assumptions in order to make this problem have the form of 
a linear mode conversion problem. 

1. |?7| -C I A| in regions far from the conversion region, so that incoming and outgoing 
waves have an eikonal form. 

2. We assume we can ignore the negative frequency solutions during the calculations, 
and superimpose them with the positive solutions at the end. 
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FIG. 3.2: The natural frequencies of the oscillators (dashed lines) are different from 
the eigenfrequencies of the pair of oscillators (dash-dotted lines), for finite coupling. The 
solid lines show the linearized approximation to the coupled frequencies given in Equation 

3. The positive frequency solutions of interest are positive for all time. 

4. In the time-frequency plane, the natural frequencies asymptote to constant val- 
ues, and cross at only one point, {uQ^to). See Figure fl3.3l) . 

5. The timescale for changes in Ui is given by T, as defined through u!i{t) ^ ^. 
This T is the same for both oscillators. 

6. In order for WKB to be valid, T ^ Tj = — , which implies that Tuji ^ 27r. 

We want to find solutions Xi{t) for large time t ^ to, given the initial conditions 
in the form of an eikonal wave 




(3.57) 




(3.58) 
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FIG. 3.3: A plot of the natural frequency of the oscillators versus time, assuming an 
offset hyperbolic tangent form. 

The solutions at large time (long after the mode conversion) should be of the form: 

xi ^ exp(-i(t - 0i) cj^) (3.59) 
X2 ^ exp(-z(t - 02) uj°'''). (3.60) 

The question of finding the solutions for large time now becomes one of somehow 
solving for the amplitudes Ai and the phase shifts (pi. This can either be done by 
solving for Xi{t) for all time (as in the numerical solution), or by treating the problem 
like a scattering problem, and solve it using matched asymptotics. This approach 
uses WKB methods away from the resonance region, and a different asymptotic 
approximation near the resonance. The regions where these approximations are 
valid will overlap; this is the matching region where the amplitude and phases of 
the far-field WKB solutions can be matched onto the local field. This matching can 
then be used to construct an S-matrix which connects the incoming WKB wave on 
one side of the resonance to the outgoing WKB waves on the other side. We can 
then use the S-matrix to directly match WKB solutions, effectively skipping over 
the resonance region. Or, since we also have constructed the local solution, it can 
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be evaluated to find the solution in the vicinity of the mode conversion, giving a 
solution Xi{t) which is a valid approximation both far from the resonance and near 
it. 

3.5.1 Resonance Region 

The asymptotic solution to this problem divides into three components. First, 
standard WKB methods are used to propagate the original solution from some early 
time into the vicinity of the mode conversion region. This region is where the two 
oscillators have very similar natural frequencies, which causes the WKB approxima- 
tion to break down. In this region, a metaplectic transformation is used to convert 
the problem into a simpler form, which can be solved in terms of parabolic cylin- 
der functions. These solutions provide a connection between the incoming WKB 
solution, and the WKB solution on the other side of the mode conversion region. 
The last component of the solution to this problem would be to again use WKB 
methods to find the solution long after the mode conversion. The mathematical 
theories of group representations and the metaplectic transformation are covered in 
the appendix. 

3.6 Local Solution 

We now need to find a way to solve the coupled oscillator Equations fl3.9p in 
the mode conversion region. Since the natural frequencies of the oscillators become 
equal in this region (the dispersion surfaces cross), the WKB approximation is not 
valid here. In order to find the solution in this region, we will perform a meta- 
plectic transformation on the incoming data, and solve the equations in the new 
variables. Once through this region, we can invert the transformation, and see that 
the outgoing data looks like another WKB solution. 
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Resonance region 




Far Field, use WKB Matching regions Far Field, WK 



FIG. 3.4: The WKB solution is valid in the far field regions, while a local solution is 
necessary in the mode conversion region. These solutions can be matched to each other 
in the matching region, where both types of solutions are approximately correct. 
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The local solution is obtained in several steps: 

• Simplify the dispersion matrix by expanding it in a Taylor's series about the 
mode conversion point, and truncating at linear order. This is simpler if we first 
move the origin in phase space to the mode conversion point. 

• Simplify the matrix further by performing a change of variables in phase space. 
This change of variables is a combination of scaling transformation and a canon- 
ical transformation. The transformations are chosen such that the diagonal ele- 
ments of the matrix form a canonical pair of variables, {q,p)- The new dispersion 
matrix that we obtain is referred to as the normal form of the dispersion matrix 
for the mode conversion problem. 

• Convert the dispersion matrix back into an operator in the g-representation. This 
gives a set of equations which will contain at most first order derivatives, since 
the matrix is a linear function of the phase space variables. 

• Solve the simplified equations in the g-representation. Then convert this local 
solution back to the original t-representation so that the matching to the WKB 
solutions can be performed. 
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(3.61) 



3.6.1 Linearized Dispersion Matrix 

In this section the dispersion matrix will be expanded about the mode con- 
version point (to,t^o)- III order to simplify this calculation, first shift the origin in 
phase space to the mode conversion point. This shift of the origin is performed by 
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a change of the dependent variables Xi{t) and X2{t): 



(3.62) 



This corresponds to the change of variables: 



t' = t — to, U = OJ — UJq. 



(3.63) 



Prom this we can see that the shift of the origin on the frequency axis corresponds 
to factoring out the carrier e"'"^"* from the solution. The shift of origin on the time 
axis is an ordinary shift. In the following we drop the prime from t' to simplify 
notation. 

The Taylor's series for the dispersion matrix about the mode conversion point 
ito^uJo) = (0,0) is 



D(t, zy) = D(0,0)+t — 



+ V 



0,0 



dv 



+ . 



(3.64) 



0,0 



Truncating this expansion at linear order gives 

/ 

D(t, u) = 



\ 



(3.65) 



7] 2uJo{uJ2t - u) 

where Ui is the time derivative of the i^^ natural frequency evaluated at t = to = 0. 



3.6.2 The Normal Form 

We want the diagonals to look like a canonical pair, so first divide out a factor 
of 2ujq. Then calculate the Poisson bracket of the diagonal terms: 



{uJlt - U, U}2t - Jy} = {uJl - OiJ2){t, -V} =UJ2-iOl. 



(3.66) 



The Poisson bracket is defined as 



{f{t,u),g{t,u)] 



d£dg_ 
dt dv 



d^dg_ 
dv dt ' 



(3.67) 
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We want the bracket of the diagonals to be equal to one, when ui < 0. So, make 

the definition B = {uj2 — ^i) > 0, and divide out a factor of B^^'^. 

\ 



^ {djit - v)IB^I^ 



V 



\ 



(3.68) 



where fj = 7]/{2uoqB^/'^). We can now perform a canonical transformation to the 
variables so that the matrix will be in normal form: 



D(g,p) 



—p rj 



\ 



(3.69) 



The operator associated with this normal form is 

-P V 



D 



NF 



V 



(3.70) 



The linear symplectic (canonical) transformation from (t, z/) to {q,p) is 

/ . 



q 



1 



gl/2 



v 



(3.71) 



UJ2 -1 
—Ul 1 

This transformation on the operator induces a transformation of our oscillators. The 
vector x(t) is transformed into a function of q using the integral transform 



x(g) 



x(t) dt. 



(3.72) 



Here, Fi{t, q) is the generating function for the canonical transformation in Equation 
(13. 71 p . This integral transform is called a metaplectic transformation. In Section 
17.61 we will show how this transform arises from the representation theory for the 
Heisenberg-Weyl group. The generating function Fi{t,q) has the property that the 
conjugate variables can be obtained from it by taking derivatives. 



dFi 



V 



dFi 



dt dq 

For the transformation given in Equation (13.711) . the generating function is 

F,{t^q) = \q'-B''Hq+%\ 



(3.73) 



(3.74) 
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3.6.3 D as an operator 

Now that we have changed variables in phase space, we need to convert the 
dispersion matrix fl3.69p into an operator acting on functions of q. This is achieved 
by making the substitution 

p—>p = idq, (3.75) 

since p is the variable conjugate to g, just as uo was conjugate to t. Note that this 
differs by a sign from the ordinary correspondence. This is because q is playing the 
role of time, not of a spatial coordinate. The sign convention for plane waves in 
time is opposite that of plane waves in space, so this means that the correspondence 
between p and the derivative with respect to q will also differ by a sign from what 
we expect. Performing this substitution gives us the equation 



-idq r] 



x(g) = 0. (3.76) 



Another way to check that the substitution in Equation fl3.75p is the one we 
want is to apply the metaplectic transformation on the Du element of the dispersion 
matrix in the t representation. We can us this to define the operator p. 

-pf{q) ^ I e^^^(*''')(Dn/(t)) rft (3.77) 
e'^'^''''^(^-^{dJit-td,)^f{t)dt (3.78) 

Use integration by parts to act with the derivative on the phase instead of on f{t). 
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This changes the sign of the derivative term. 



-Pfi<l) = I ( ^i^it + ^^t)e^^^^'''^^^ fit) dt (3.79) 
^ ^^^t-^)e'^^^''Af{t)dt (3.80) 



i3V2 V ' dt 



J (^1^ + ^'^'^ - ^2^) ^^^^^*''^) (3-81) 

y (-iS^/'t + q) e*^i(*''')/(i) (3.82) 
1 \ e'^^(*'^V(i) (3.83) 



dq 

J (-zS.e^^^^*'")) /(t) dt (3.84) 

-idg j e'^'^'^'^^fit) dt (3.85) 

-29J(g) (3.86) 



This estabhshes the relationship p = idq. 

3.6.4 Local solution in the q representation 

We now want to solve Equation (13.761) . The second row is an algebraic equation 
which gives us 

X2iq) = -^xi(g). (3.87) 
Q 

Substituting this into the first row gives us a first order differential equation for 
xi{q). 

dMQ) = —^M (3.88) 

q 

This has the form of a logarithmic derivative, and can be integrated to give 

xi(g) = e^'''L"(«) = e^^'(''^l'?l+^^''g('')). (3.89) 

The choice of branch cut for arg(g) will determine the solution we obtain. For g < 
we have two choices, i arg(g) = iiyr. This means that there are two options for the 
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amplitude of the solution when q < 0, = e^'^^'^ . In either case, when q > 0, the 
magnitude is = 1. With theses two options for the magnitude, we can define 
two solutions for Xi(g), both of which are valid solution to Equation f l3.88p . These 
are 



xtiq) 



and 



x\{q) 



ir) 



if g < 
if g > 

if g< 
if g> 



(3.90) 



(3.91) 



where r is called the transmission coefficient, 



r 



-TTT) 



< 1. 



(3.92) 



If the initial conditions are such that the energy starts out in the first oscillator, then 
x1{q) is the appropriate solution to use. The factor of r multiplying the solution 
for g > represents the energy lost to the second oscillator in the mode conversion. 
The solution for the second oscillator is then 



X2{q) 



(3.93) 



r/lgl*'' if g < 

_7-^|g|"7^-i if g > 

Because of the negative power of g, this solution is localized near g = 0, and its 
amplitude diverges as g — ^ 0. We can combine these solutions into a vector. 



^ a;i(g) ^ 



V 



(g) 



A{q)\q\ 



I 1 ^ 



(3.94) 



where Ai^q) = 1 for g < and v4(g) = r for g < 0. 



3.6.5 Transform back to i rep 

The local solution that has been obtained in the g representation can now be 
converted back to the t representation to compare with the WKB solutions. This is 
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achieved by an inverse metaplectic transformation. 



Xi{t) 



1 



(3.95) 



This is an integral form of the parabohc cyhnder equations, which are known to 
be the correct local form of the solution in the mode conversion region in the t 
representation. However, in order to match this local solution to the WKB modes far 
from t = 0, we only need the asymptotic form of this expression for large t. Because 
of the singular nature of X2{q), this metaplectic integral is difficult to evaluate. In 
9|, Tracy et al. use the stationary phas e ap proximation to obtain the asymptotics of 



this integral. Brizard et al. describe in 



201 ] how to compute the Fourier transform of 



X2{q), which gives the p representation of the lower channel. In the p representation, 
the singularity at the origin no longer appears, and the integral becomes easier to 
compute. 

An alternative approach is to solve for X2{t) directly in the t representation. The 
metaplectic transformation of xi{q) gives Xi(t) in terms of the parabolic cylinder 
function. Insert this into the t representation of the equation for X2(t), and use 
the recurrence relations for the parabolic cylinder function allow us to evaluate the 
derivative: 

X2{t) = ^{t-idt)xi{t). (3.96) 

This representation of the solution can be easily compared to numerical simulations 
of the original system of equations, since methods for calculating the parabolic 
cylinder functions are readily available. The particular form of parabolic cylinder 
functions which are obtained for Xi{t) and X2{t) will depend on the parameters B, 
ujj, and T]. For a the plot of a specific example see Figure (13. 6p . 
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3.6.6 The Transmission and Conversion Coefficients 



At this point, the transmission and conversion coefficients can be obtained from 
the solutions in q and p representation. Comparing the incoming (negative q) and 
outgoing (positive q) values of xi gives the transmission coefficient r: 

xi{q) 



(3.97) 



Because of the choice of branch cut that is needed to deal with the singularity at 
g = 0, we take —1 = e~*'^ in the above formula. Calculation of the conversion 
coefficient P is more difficult because of the singular nature of X2{q) at the origin, 
q = 0. However, the amplitude of (3 can be calculated by action conservation: 



|2 = i_|^|2 = i_e-W. 



(3.98) 



Getting the phase of (3 correct is a more difficult calculation, which is performed in 
6|. In that paper, P is found to be given by 



V2 



TTT 



?7r(— z|?7p) 

With these coefficients, we can write the scattering matrix as 



(3.99) 



X 



™OUt 




(3.100) 



where and x° are the complex amplitudes of the WKB solutions to the left and 
right of the mode conversion. This scattering matrix allows us to find the amplitude 
and phase for rays leaving the mode conversion, given the amplitude and phase on 
the incoming rays. 
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FIG. 3.5: Numerical solution showing mode conversion between the two modes. Sim- 
ulation Parameters: uji{t) = 20- 5tanh(i/5), W2(t) = 20 + 5 tanh(f/5), xo = (1,0), 
xo = (0,0) 

3.7 Numerical Solution and Comparison with The- 
ory 

The equations of motion in Equations fl3.6p and (13. 7p . together with the appro- 
priate initial conditions, can be solved numerically to find the output amplitudes 
y4°"* and phase shifts (pi. The ODE solver in MatLab was used to perform this cal- 
culation. The variation in the natural frequencies was taken to have a hyperbolic 
tangent form, as shown in Figure 13.31 The numerical solution is shown in Figure 

m 

A more detailed comparison of the local solutions to the WKB solutions was also 
performed, using the linearized dispersion matrix in Equation (13.691) as a starting 
point. The local solution, which is given by parabolic cylinder functions in the t 
representation, is an exact solution to wave problem in this normal form. We then 
added quadratic terms (such as g^) to the normal form dispersion matrix, with small 
coefficients. This will cause the numerical solution to deviate from the parabolic 
cylinder form. Figure (13.61) shows the results of one such calculation. Far from the 
mode conversion, the WKB solution can correctly capture the effects of the quadratic 
terms, while near the mode conversion, the parabolic cylinder functions model the 
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jumps in amplitude and phase. The effects of these jumps are taken into account in 
the WKB solution shown in Figure (13. 6p . by using the scattering coefficients r and 
j3 to modify the amplitude and phase at the origin. For some reason, the phase of 
the second oscillator in our simulations was off from the expected phase by 7r/4. We 
suspect that this could be due to the phase convention chosen for the metaplectic 
transformation in Equation fl3.95p . Other authors 20| include a factor of in 



the normalization of the metaplectic integral, which could be what we are seeing 
in these simulations. Additionally, there is the possibility that this phase error is 
due to the fact that this calculation with oscillators uses functions of time, rather 
than functions of space as is the case for wave problems. This leads to the opposite 
convention for the correspondence between derivatives and phase space variables, as 
shown in Equation (13.751) and the subsequent discussion. 



3.8 Summary 

In this chapter we described how phase space techniques can be used to solve 
multicomponent wave problems, including problems which exhibit mode conversion. 
We gave an example of a pair of coupled oscillators which undergo resonant con- 
version, and show how the phase space techniques can be applied to this problem. 
The key idea is that the symbol of the wave operator (the dispersion matrix) is a 
smooth function on phase space. We can then apply canonical transformations on 
phase space to simplify the form of the dispersion function. Also, since the disper- 
sion function is smooth, we can expand it about a point in phase space. For the 
mode conversion problem, this local expansion gives an approximate wave equation, 
which can be solved to find the structure of the solution in the mode conversion 
region. This local solution can then be matched onto WKB solutions, which are 
good approximate solutions far from the mode conversion. This allows the mode 
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FIG. 3.6: Absolute value and relative phase of xi and X2, from numerical solution (red), 
superimposed on the WKB values (dashed curves). The exact parabolic cylinder func- 
tions (black) are used to set the amplitude and phase of the outgoing WKB solution. 
The amplitude and phase of the local solution is matched to the incoming WKB solution 
at < = — 3. A phase mismatch of 7r/4 as been arbitrarily removed from the X2, in order 
to better show the comparison between analytical and numerical solutions (see text for 
more details). The simulation parameters for this figure are the same as in Figure (|4.4p . 
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conversion to be treated as a scattering problem, where the S-matrix connects the 
WKB solutions on opposite sides of the conversion. 

In this chapter we also report numerical simulations, which show the matching 
between the local solution, the WKB solutions, and direct numerical simulation 
of the equations. The matching shown in Figure (13. 6p is fairly good, giving the 
amplitude and phase jumps across the conversion. However, higher order terms in 
the dispersion matrix have been neglected in the local analysis, which only used a 
linear approximation for the dispersion matrix. The neglected terms cause the local 
solution to diverge from the WKB solution fairly rapidly. In the next chapter, we 
will reintroduce the quadratic terms into the dispersion matrix, and derive a new 
local solution. We will show that keeping these terms leads to much better matching 
between the corrected local solution and the far-field WKB solutions. 



CHAPTER 4 



Higher Order Corrections 



4.1 Introduction and Motivation 

In previous work 6|, |7|, |9| it was shown that phase space ray-tracing techniques 
can be used to solve wave problems exhibiting mode conversion. Such a problem can 
be written in matrix form. We consider solutions at a given frequency, and write 
the two equations for the coupled wave modes together. 

t){q,-id„uj)-^{q) = (4.1) 

While many problems of interest concern waves in multiple spatial dimensions, in 
this chapter we limit our analysis to the case where q is one-dimensional. Addi- 
tionally, we will suppress the frequency dependence for brevity of notation. Using 
the symbol calculus , we can define the symbol of the wave operator as a 
matrix valued function on wave phase space, D(g,p). Here, the variable p cor- 
responds to the operator —idg, and products of operators are symmetrized (e.g., 
qp — * —i{qdq + dqq)/2). In the vicinity of a mode conversion, there are two roots 
of the dispersion relation det(D(g,p)) = 0. These two curves in phase space locally 
have a hyperbolic structure (an "avoided crossing", see figure HTTj) . Linearizing the 
g, p-dependance of the dispersion matrix about the center of the hyperbola, and 
converting this linearized symbol back to an operator, gives a set of coupled equa- 

54 
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tions which can be solved for the local wave fields. Matching these local solutions 
onto uncoupled WKB solutions (which are a good approximation to the solutions 
far from the mode conversion region) gives transmission and conversion coefficients 
for the incoming and outgoing waves. These coefficients can be used to treat the 
mode conversion as a ray-splitting process, where amplitude on the incoming ray is 
split onto the two types of outgoing rays. 




FIG. 4.1: The phase space structure of a typical "avoided crossing" mode conversion. The 
hyperbohc dispersion curves are the soUd Unes, and the dashed Unes are the dispersion 
curves for the uncoupled modes. 

This ray-splitting approach captures the jump in amplitude caused by the cou- 
pling between the two modes at linear order in phase space variables. However, 
higher order terms in the wave equation can lead to additional effects. For example, 
the amplitude variation familiar from WKB theory is not captured by the linear 
solution. This could cause difficulties when attempting to match the local wave 
fields onto the incoming and outgoing WKB solutions. Figure (13.61) illustrates this 
effect in the example of two coupled oscillators. In this example, the local solu- 
tion captures the jump in amplitude at the mode conversion, but misses the WKB 
amplitude variation. This limits the matching region to a small range right near 
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the mode conversion, which could make numerical ray-tracing algorithms somewhat 
unstable. However, this example also suggests that there may be a simple correction 
to the linearized solution which will also capture the WKB amplitude variation. 

In this chapter we consider the effect of adding generic quadratic terms to the 
wave equation. These terms will modify the uncoupled dispersion relations, which 
will in turn modify the far-field WKB solutions for the incoming and outgoing waves. 
When the new quadratic terms are added to the coupled equations, they will give 
us a new local solution for the wave fields. This local solution will contain both 
non-propagating "near-field" contributions and phase modifications to the original, 
linear order, local solution. The near-field terms do not propagate, and therefore do 
not modify the transmission and conversion coefficients. The phase modifications, 
however, lead to phase corrections in the scattering coefficients at order e\r]\^. Lastly, 
we give a comparison with numerical solutions for a simple example. 

4.2 Extension to Higher Order 

In general, the Taylor's series expansion of the dispersion matrix in Equation 
fl3.64p will contain terms of higher order than the linear terms kept in the approxi- 
mation of the previous section. These terms will have an effect on both the far-field 
WKB solutions, and on the local solution. If we can calculate these effects, then 
we can obtain a better match of the local solution to the incoming and outgoing 
WKB solutions. This will allow us calculate any corrections that there may be to 
the scattering coefficients. 
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4.2.1 The Normal Form in One Spatial Dimension 



As described in Section 13.6.21 and shown in 2, la], the 2x2 symbol of the 
dispersion matrix can be put into the following "normal form" at linear order: 



DNF(g,p) 



—p 7] 



\ 



7] q 



(4.2) 



Here, ?7 is a constant since we are working in a two-dimensional phase space. The 
higher order corrections to this matrix appear at quadratic order in the phase space 
variables: 

D(g,p) = DNF(g,p) + D2(g,p) + 0{z'). (4.3) 

Each element of D2 can contain terms which are quadratic in the phase space vari- 
ables z = {q,p). 

In this section, we will argue that it should be possible to perform a phase space 
dependent change of basis which puts the dispersion matrix into the form 

D'(2) = Qt(2) ■ D(2) ■ Q(^) = Dnf(2) + ' 



V 



rfi'V^ 



+ 0{z^). (4.4) 



This would imply that, in general, we only need to consider the effect of additional 
quadratic terms on the diagonals of D, but not on the off-diagonals. 

We are interested in removing the second order terms from the off-diagonals of 
D. We can assume that D is a hermitian matrix, so we will write the off-diagonal 
terms of D2 as 



D2(g,p) 





di 


+ 


/ 




qp + 


/ 






) 






) 




^ 4 


/ 



(4.5) 



We will use a near-identity change of polarization basis, so we can write Q as 

Q{q,p) = l + ap + hq. (4.6) 
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Some algebra shows that the matrices a and b should have the following form in 
order to remove the second order terms from the off-diagonals: 



di 



a 



\ 



J 



V 



-d% -e-^V^3(a* + rf2) ) 

Here, is the phase of the coupling, rj = \ri\e^'^, and a is given by 



(4.7) 



(4.8) 



a 



' 2 ~ 2 



±i;^/dl-M,d3. 



(4.9) 



This transformation will make the off-diagonals in D' constants plus terms starting 
at 0{z^). 

In order to obtain the form of D given in equation (14. 4p . we need to correct 
the first order terms in the diagonals, since the change of basis will mix q and p. 
A canonical transformation of phase space will recover the normal form at linear 
order: 



P 



1 - 23fJ(a7/*) 23?(dir/* 



|i3|i/2 



(4.10) 



2^{d-iri*) 1 + 2^{ari* 
Here the normalization is introduced to make {q',p') a canonical pair. It is given by 
the Poisson bracket of the transformed diagonal elements. 



B = {D[„ D',,} = (1 - 4(3fJ(ar/*))2 - A^id^v^MdsV*)) {q,P} = {q,p}{1 + 0{e')). 

(4.11) 



4.2.2 Moyal Corrections for Phase Space Dependent Changes 
of Polarization 

In the previous section, a phase space dependent change of polarization is used 
to put the dispersion matrix into the form where its off diagonal elements are con- 
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stants with a small perturbation that starts at order z^. This transformation is 
achieved through the conjugation 

D'(z) = Qt(^) ■D(z) ■ Q(^). (4.12) 

However, since the matrices D and Q are actually matrix valued symbols of opera- 
tors, we really need to use the Moyal star product to multiply the elements of the 
matrix. The noncommutative star product is used in the symbol calculus so that 
the symbols (functions on phase space) maintain the commutation relations of the 
original operators. So, we should actually use the expression 

D'(z) = Qt(z)>KD(z)*Q(z), (4.13) 

where the matrices are multiplied in the usual way, but the elements of the matrices 
are multiplied using the star product. In this section we will argue that the effect 
of the star product is to introduce additional terms into the expression for D'(2;) 
which are of higher order than those that we are considering in this dissertation, 
and therefore can be neglected. 

The Moyal star product of two symbols A{z) and B{z) is often written as a 



formal power series (see 



l| with h set to 1 since we are studying classical fields). In 



general, this series can contain infinitely many terms: 

A{z)*B{z) = A{z)exp I -^Ja/si^ 1 B{z) (4.14) 



2 dza dzfs 



n=0 \ 



n 



= A{z)B{z) +'-{A,B}-^ {d^A dlB - 2 d,d,A d,d,B + 9jA d^B) +... 

(4.16) 

Here, Jap is the symplectic matrix. In our problem, we have an implicit small 
parameter because of the asymptotic nature of the calculation. This allows us to 
truncate this series at some order to obtain an approximation for the star product. 
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We are assuming that the symbol of the dispersion matrix has a well defined 
Taylor's series in the mode conversion region, and that the lowest order terms in 
the series dominate. This can be expressed mathematically by introducing a small 
parameter e. Using the multi-index notation, we can expand the symbol in terms of 
all possible monomials: 

oo 

A{z) = J2 <^rn{ezr = E (4.17) 

|m|>0 M=0 \m\=M 

The ordinary product of two symbols is given by 

A{z)B{z) = (E^^^ E «-^"^) ■ (E^'^ E ^"^") (4.18) 

yA/=0 |m|=7V J \n=0 \n\=N J 

oo 

= E E E ^rnbnZ"'^'' (4.19) 

N,M=0 \m\=M \n\=N 

= E'^^(^^)'- (4-20) 

\l\>0 

The star product is given by a similar formula, where ■ is replaced by *: 

oo 

A{z)*B{z)= E e'^'^'^ E E"-^"-^"*^"- (4-21) 

Af,M=0 \m\=M\n\=N 

So we need to consider the star product of two generic monomials, z"^ * z"-. Because 
the star product can be written as a series in powers of derivatives, we can express 
the product as a polynomial of degree L = \m\ + \n\. Although all of the coefficients 
of this polynomial can be calculated from equation (14.141) . we compute only the 
highest power: 

L-l 

2- * 2" = 2-+" + J] (4.22) 
\i\=o 

This equation is significant for us because it means that the star product only intro- 
duces additional monomials of degree less than the degree of the ordinary product. 



We can use this equation to write 



L-l 



A{z)*B{z) = A{z)B{z)+ E E^^^-E^'^' 

Af,Af=0 \m\=M\n\=N \l\=Q 

oo L— 1 

L=0 \l\=Q 

(oo L-l 
L=0 \l\=0 
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(4.23) 
(4.24) 

(4.25) 



This means that the corrections due to the star product are at least one order higher 
in epsilon than the regular commutative product, since the coefficients of {ezY in 
the series for the corrections are at least 0{e) while the coefficients in the series for 
A -5 are C(l). 



4.2.3 Quadratic Order Terms 



As shown in the Section 14.2. H the quadratic approximation to the dispersion 
matrix can be put into the form of equation (14. 4p . which, with the introduction of 
the small parameter e in the quadratic terms, is 



D(g,p) 



v 



-p + e(aip^ + hipq + Ciq^ 



r] 



q + e(c2P^ + b2pq + a2g^ 



\ 



(4.26) 



This second order matrix valued function on phase space can now be converted into 
a pair of coupled second order differential equations for the two modes: 



where the operators on the diagonals are 



V 



v^l(?) 
^2(g) 



0, 



(4.27) 



Du = idg + e{ai{-idqf - ^(S^g + qdq) + Ciq^) 



(4.28) 



and 



-D22 = g + e(c2(-i(9g)^ - ^(<9qg + qdq) + a2q^). 
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(4.29) 



Because of the form of these equations, it will be convenient to analyze the lower 
channel in Fourier space. The Fourier transform of these equations gives the p- 
representation of the equations: 

\ 



Dii V 



\ 



which has diagonal elements 



22 



v 



Mp) 
Mp) 



0, 



(4.30) 



ibi 



and 



D'n = -p + e(aip^ + —{pdp + dpp) + ci{idpf), 



D'22 = idp + e{a2{idpf + ^(p<9p + dpp) + C2P^). 



(4.31) 



(4.32) 



We proceed by first finding the new, uncoupled, far-field WKB solutions, and 
then finding the coupled local solution. Finally, we examine the local solution and 
match it onto the WKB solutions, which gives us the order e corrections to the 
scattering coefficients. 



4.2.4 Uncoupled WKB Modes 



Because of the quadratic terms in the dispersion matrix [Equation fl4.26p ]. the 
uncoupled WKB solutions will contain modifications of order e. These corrections 
can be found by solving Equation fl4.27p [or, if we use the p-representation. Equation 
(OU]) ] with 7/ = 0. 

First, consider the second order derivatives, —eaidg and — ec2(9q. These terms 
introduce new solutions which have the form e*^/"^*^ and e^il'^'^'^. The dispersion sur- 
faces for these solutions are far (p ^ 1/aie) from the solution that we are interested 
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in {p ^ 0). The limit e — ^ is a singular limit, since these new solutions cease to 
exist for e = 0. However, since these solutions are related to dispersion surfaces 
which are well separated in phase space from the mode conversion we are studying, 
we can treat the new solutions separately. Effectively, we can ignore these solutions 
in the limit e — 0, and only consider the solutions related to the mode conversion. 

Neglecting the second order derivatives, the upper channel of equation (14.271) 
becomes 

(1 - ehq) (zdgMq)) + e + Cig') Mq) = 0- (4.33) 

This is a logarithmic derivative of ipi^q), which can be integrated to give us, up to 
an overall amplitude from the constant of integration, the solution to order e: 



iJi{q) = exp 



(l|^ + !!£l^)=e-V3(i^l|^^0(a). (4.34) 



These two terms have a very nice correspondence to standard WKB theory, which 
can be seen when we relate them back to the uncoupled dispersion function Dii{q,p). 
The WKB phase is given by the integral of the "momentum" V{q), which is the func- 
tion which solves -Dii(g, Vi^q)) = 0. Solving this to order e gives V{q) = -Dii(g, 0) = 
ecig^, which is the first term from the Taylor's series expansion of Dii{q,p) about 
p = 0. This integrates up to give exactly the phase which appears in equation (I4.34p . 

As in Equation (I2.4ip . the WKB amplitude comes from the next term in the 
expansion of Dii{q,p) about p = 0, 

dDu{q,p) "^/^ _ 

p=0 



l + ebiq\-^/^. (4.35) 



dp 

If we expand this amplitude as a series in e it gives the same order e correction to 
the amplitude which appears in equation (14.341) . 

An analogous calculation can be performed for the second channel in the p- 
representation. Starting with equation (14.301) and setting t] to zero gives us a similar 
solution, except for the sign of the b coefficient. Our uncoupled WKB modes are 
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then 

^r{q) = Up) = rr^^ - (4-36) 

These solutions will be used to match the incoming and outgoing WKB waves onto 
our local solution. 

It remains to check that we do not introduce errors at order e by neglecting the 
second order derivatives. This check is straightforward since equation fl4.33p can be 
used to write the derivative of ipi{q) as 

d,Mq) = ^fmM- (4.37) 

Since this derivative will be multiplied by eai in the second order term, we have that 
the effect of this term is at least order when acting on our solution in fl4.36p . The 
same analysis holds for the second order derivative acting on the lower channel. 

4.2.5 Coupled WKB Modes 

The uncoupled modes are generated by the eigenvalues of the dispersion matrix 
when we set the coupling parameter r] to zero. If the dispersion matrix is in normal 
form, then the diagonals of the matrix can be interpreted as the generators of the 
uncoupled modes. If, on the other hand, we are interested in the coupled modes, 
then we can use the eigenvalues of the dispersion matrix to generate the modes. This 
is equivalent to using the determinant as the generator, since the diagonalized matrix 
has the eigenvalues on the diagonals. So, in order to see the effect of the coupling 
on the WKB modes, we need to calculate the determinant of the dispersion matrix 
with 77 7^ 0. First, we recall how this works in the case of the linear approximation 
to the dispersion matrix, and then extend the calculation to include the quadratic 
order terms. We will need these expressions for the coupled WKB modes when we 
want to match incoming and outgoing rays across the mode conversion region. 
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FIG. 4.2: Dispersion curves for the uncoupled modes, showing the effect of the higher 
order terms (e = 1/5). The red curves are solutions to Dn ~ 0, and the blue curves 
are solutions to D22 = 0. The original crossing is at g = p = 0. The parameter 
values used for these plots are: (a) ai = 1: This term introduces a new branch to the 
dispersion surface Dn ~ at p ~ 5. Ase^O, this branch moves off to infinity, 
(b) ci = 1: This term introduces a curvature into the Dn dispersion surface. In the 
p representation, an uncoupled solution in the upper channel would now look like an 
Airy function, (c) bi = I: Although not obvious from the dispersion curves, this term 
introduces an amplitude variation which has a square-root form, and which matches the 
WKB amplitude variation due to action conservation, (d) 61 = 1, Ci = 0.1: In general, 
there will be several higher order terms, all of which affect the dispersion surfaces. 
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The determinant of the hnear part of the dispersion matrix is 

detD(g,p) = -pq - |r/|l (4.38) 

With ?7 = 0, solving for detD(g,p) = gives the two uncoupled modes p = and 
g = 0. When we bring the coupling back in, then solving for p as a function of q 
gives us 

I 1 2 

= (4.39) 
This function goes into the phase of the WKB solution 

z/'i(g) = y4i(g)exp j p(g') c??'^ = ^1(9) exp(-i|?7|^ ln(g)). (4.40) 

This logarithmic phase is what allows us to match the WKB solutions onto the local 
solution of the linearized equations. 

When we include the second order terms in the dispersion matrix, we get 

detD(g,p) = {-p + eVi{q,p)){q + eV2{q,p)) - |r/|' = 0. (4.41) 

Here, T>i{q,p) and V2{q,p) are the second order terms from the dispersion matrix 

Vi{q,p) = aip^ + biqp + ciq'^, V2{q,p) = a2q^ + b2qp + C2p'^ . (4.42) 

We now want to solve this equation for p{q). Expand p in powers of e 

— Ir/P 

P{<l) = Po{<l) + epi{q) + ... = h epi(g) + . . . , (4.43) 

Q 

and insert this into equation fl4.4ip . Keeping terms of order e, and solving for pi{q), 
we get 

Pi{q) = ^{-poV2{q,Po) + qVi{q,po)) (4.44) 

|2 

-V2{q,po)+V,{q,po) (4.45) 



c,q' + \v\'\a2-h) + Oi\rj\'). (4.46) 
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The term Ciq^ comes from the curvature of the uncoupled dispersion surface for the 
upper channeL The additional two terms are effects of the coupling. They enter the 
WKB solution at order e|?7p: 



■01 (g) = Ai{q) exp j z j po{q') + epi(g') dq 



(4.47) 

Ai(g) exp |-z|r/|' ln(g) + ze + \r]\'\a2 - bi)q^ | . (4.48) 

Here, the amplitude is computed from the derivative of the eigenvalue which 
asymptotically approaches Du, and which is the generator for ray evolution in the 
upper channel: 



dDa{q,p) 


-1/2 


dp 


p=p{q) 



-l+e6ig+^ + ... 



-1/2 



(4.49) 



The calculation for the coupled WKB mode in the lower channel proceeds along 
similar lines, and uses the p representation of the equations. The result is 



02(p) = Mip) exp < -i\ri\'^ ln(p) + ie 



C2P 



+ |?7p(ai - b2)p 



with 



1 + eb^p + 



-1/2 



(4.50) 



(4.51) 



We will use these expressions for the coupled WKB solutions when matching 
the local solution to the propagating modes. 

4.2.6 Local Coupled Solutions 

Now that we know the form of the incoming and outgoing WKB waves, we 
need to solve the system of equations locally in order to find the scattering coeffi- 
cients which connect the incoming to outgoing waves. We will find the higher order 
corrections by expanding the local fields in e, and then using the (9(e) equations to 
get the local solution. 



^i(g) = g-H^(l + eei(g) + 0(e2)) 



(4.52) 
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Mq) = + eQ2{q) + 0{e^)) (4.53) 

Because of the form of the equations, it is convenient to expand Gi and G2 as power 
series in q. 

00 00 

0i(g) = E ®2(g) = (4.54) 

n=— 00 n=— 00 

In order to compact the notation, and simphfy the calculations, define jSn as 
the coefficient obtained when Fourier transforming an arbitrary term in the series: 

J dqe-'P''q-'\'^\\'' = /5„p*l''l'- V"- (4-55) 

This definition is possible since it can be shown that 

dqe-'P'^q" ocp-'^-K (4.56) 



The inverse transformation will also involve 

/ dpe'Py^^^'p-" = ^g-'l^l'-'g". (4.57) 

J Pn-l 

Notice how positive and negative powers of q and p exchange roles. This is most 
likely due to properties of the dilation group, since these functions are related to 
representations of that group. 

Evaluating the Fourier integral by using the Hankel formula for the gamma 



function, we can find Pq, as in 



61 . By analytic continuation of the complex gamma 



function, we find, for any integer n, 



f^n ^ I, ^ (4.58) 

i («|?7| — n) 



We will also need ratios of /3's, which can be found using the properties of the gamma 
function. 

= i{i\T]f-n) (4.59) 
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Our equations involve the operators ]3^, g^, and Using the definitions above, 
we can evaluate the action of these operators for an arbitrary term in our series. 

pq-'\'^\'+'^ = p[3n [ rfpe^^y^l'-"-^ (4.60) 



Pn-l 

Applying this formula twice gives us 



(3n I dpe'^Y"^^ (4.61) 
q-i\v\'+n-i (452) 



Pn-2 



fq q-^M'+- = ( _1 + ^ ) q-^W'+n_ (4.54) 



Similar calculations yield 

^-i|r?|2+n. ^ ( _ 

and 

q'2 q-i\v\'^+n ^ q-i\v\'^+n+2 (4.65) 

We can now write out our system of equations in (14.271) using the coefficients 
and our series expansions for the fields. Keeping only terms of order e, we get 

-pj^sng-*'"'''"" + 2^i(g,p)g"'''''' - |r/r5Z5ng-^l''l'-'+" = (4.66) 

n n 

and 

J]s„g-'i''i'+" -r'2(g,p)g"''''''^'-g5Z5„,g"''''''"'+" = (4.67) 

n n 

Now evaluate the operators using the expressions above. 



'n-l 



^By j5(7 we mean the operator whose Weyl symbol is pq. Since p and q commute as variables in 
phase space, we could also have written this operator as qp 
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n 

- (^^^ g-H^-^ - h (-^ + g-^""^-^ - a^g-^l'^l^^^ = (4.69) 

These equations can now be solved for the coefficients Sn and s„. For n ^ {—3, —1, 1, 3} 
these can be solved to find s„ = s„ = 0. Otherwise, we have a set of linear equations 
which can be solved to give, after generous application of equation fl4.59p to simplify 
the coefficients, 



S3 








(4.70) 
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(4.71) 
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= i{hi-a2] 
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(4.72) 


Si 


= i{hi-a2] 


(3o 


bi 
' 2 


(4.73) 


-1 


= i{b2-ai] 
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2P-1 


(4.74) 


-1 


= i{h2-ai] 


Po 

P-2 


b2P-i 
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(4.75) 
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iC2 Po 

3 f3-3 






(4.76) 
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ic2 

3 /9-4 






(4.77) 



Applying the Fourier transform in equation (14.551) . we can find the local solu- 
tions in the p representation. 

Mp) = j dqe-'P'^Mq) (4.78) 

Mp) = -v*P-ip'^'^'' ( 1 + ^ E ^-P" + 



(4.80) 



The terms in the series are 
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which means that the coefficients can be calculated explicitly to give 



= f f (4.82) 

a_3 = (4.83) 

-1 = + (4-84) 

P-i Po 

^-1 = H^i-a2)-5 77-5— (4-85) 

P-i ^ P-i 

^1 = ^(&2-ai)^ + | (4.86) 

P-2 ^ 

^1 = ^(^2-ai)|^-| (4.87) 

as = ^ (4.88) 

as = ^. (4.89) 



4.2.7 Matching the Upper Channel 

In order to find the connection between the incoming and outgoing wave fields, 
we need to match the WKB solutions in Section 14.2.51 with the local solutions in 
Section l4.2.6[ This matching will allow us to find the scattering matrix for this mode 
conversion, which determines the amplitudes and phases of the outgoing waves given 
the amplitudes and phases of the incoming waves. The various WKB waves can be 
identified with the dispersion surfaces from which their phases are calculated. This 
lets us label the branches of the dispersion surface by the type of WKB mode which 
it generates, as in Figure 14.31 

Matching a WKB wave incoming in the upper channel requires comparison of 
the uncoupled WKB solution and the local solution in the upper channel. These 
are given by 

,(WKB), ^ e^^^i'?'/3 f ieciq^ ebiq ^ aa\ 



72 




FIG. 4.3: Dispersion surfaces for the uncoupled WKB modes, defined by Dii{q,p) = 
and D22iq,p) = 0, cross at the mode conversion point. The coupled dispersion surface, 
defined by solving det(D{q,p)) = 0, has a hyperbolic structure in the mode conversion 
region. Its branches asymptote to the uncoupled dispersion surfaces. 
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and 

.(Local)/ N _i|„|2 ( ■ \ \2t u \ , ^^1^ , 

•01 (9)=9 exp I ze|?7| (a2-0i)g + ^-H ^^-^^^ 

We have left out the terms with negative powers of q from the phase of the local 
solution, since the effect of these terms is localized to the mode conversion region. 

A matching point < is now chosen where both of these expressions are 
valid. The amplitude and phase of the incoming WKB mode at this point are used 
to set the incoming amplitude and phase of the local solution. 



, (WKB) / N 

Aqi}^^' exp {i\r]\^ehq, - i\v\^ea2q,) i>f'"''^'\q) (4.93) 



where A is the complex amplitude of the incoming wave. We can put the term 
into the phase as a logarithm, if we first take the absolute value. 

= (-iyl^l'exp(z|r7|2log(|g*|)) = e""l^l' exp (z|r/|2log(|g*|)) (4.94) 

Here we have used —1 = e*'^. The + sign is taken in this exponent since we know 
that the amplitude of the upper channel decreases across the mode conversion when 
it loses energy to the lower channel. A more careful derivation of this sign can be 
found in Reference Gj. (We also are assuming that both waves are positive energy 
waves. If one of the modes was a negative energy wave, then there is the possibility 
that the amplitude of both increase across the mode conversion.) This gives us an 
expression for the local field, including the amplitude and phase of the incoming 



wave. 



^[{q) = Ae-"H%xp(z|r/|2log(|g,|) + z|77|M6i-a2)g.)V^f°'"'^(g) (4.95) 
Now choose a matching point g** > where this local solution can be matched 
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to the outgoing WKB wave. We then can write the outgoing field as 

= ,,WKB,"' / r^'il) (4.96) 
= Ae-"l''l'exp(2|r/|2log(|g,|)+2|r/|2e(6i-a2)g*) (4.97) 
X exp (-i|77plog(k**l) - ^l'7pe(6i - a2)g„) , (4.98) 

This expression shows that, in addition to the amphtude jump found in the hnear 
analysis, there is a phase shift of order e|?7p which is due to the presence of the 
quadratic terms. This expression is valid for any matching points q^, and g** in the 
appropriate matching regions. However, it simplifies if we choose matching points 
symmetrically about the mode conversion point. In this case, we get 

<(g) = Ae-^l"!' exp {2iM\{a2 - h)qM) , , (4.99) 

VI - ebiq 

where we have defined gM = —q* = q**- This expression includes both the linear 
order effects calculated previously (e.g., in [6|), as well as the new effects of the 
quadratic order terms. Theses effects are seen in the expression for the coupled 
WKB mode given in equation fl4.47p . 

4.2.8 Matching the Lower Channel 

Use the p representation to match the lower channel. The equations to match 



are 



and 



^2 (P) = ^^^F= = I ^ — + 0{e)j (4.100) 



VI + eb2P 



^LocaD^p) = -,^*P_,p^M' exp ( ze\v\\a, - b2)p - ^ + ^ ) . (4.101) 



The matching proceeds as in the first channel, except that the initial amplitude and 
phase are given by the incoming data in the upper channel. Since the pair given in 
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equations (14.521) and fl4.53p have the correct relative amphtude and phase, we can 
apply the amplitude and phase corrections from (14.951) to the q representation for 
the lower channel. After Fourier transforming into the p representation, this gives 
us the local field for the lower channel. 

0^(p)=Aai(g,)#°^^')(p) (4.102) 

The corrections from the first channel are defined as 

= e"^l''l'exp(z|r/|^ln|g,| +i\7]\^e{hi - aa)?*). (4.103) 

We now pick a point > where we match the local solution onto the outgoing 
WKB wave. The outgoing wave is then 

I (Local) / \ 

c^'Lip) = AaM%,J^<P^r''''\p) (4.104) 



where 



-Ari*p_^aMc^2ip*)^==^, (4.105) 
Vl + e02P 



tt2(p*) = exp(z|?7pln \p^\ - i\rf\'e'^2 - ai)p*). (4.106) 



Putting these together means that the outgoing converted wave has the form 
'^2(P) = -A w, Ig^r'"' b*!""' exp {ieH\{h, - a2)q, - {h - ai)p.)) 



r]V[%\(]Y) ' Vl + eh^p 

(4.107) 

4.3 Comparison with Numerical Solution 

In order to verify the corrected solutions derived above, we compared both 
the corrected local fields and the matched WKB waves to the results of numerical 
simulations. In order to avoid the singularities in the solutions, the numerical cal- 
culation was carried out in the t representation, where the phase space coordinates 
(t, u;) are those defined by the linear canonical transformation in Equation (I3.7ip . 
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The associated metaplectic transformations of the WKB waves in equation fl4.36l) 
are 

^;^^^)(t)= /e-^^(*"^) f ' -.dq (4.108) 
J Vl - ebiq 

—oo 

and 

oo „ 

,(WKB), ^ _ / iF2{t,p)_£ 



^^-'^-'(t)= / e-^^*'^^^==dp. (4.109) 

— oo 

where -Fi(t, g) and F2{t,p) are the generating functions for the hnear canonical trans- 
formations: 

F,{t,q) = ^it'-2V2tq + q') (4.110) 
F2{t,p) = -^{t^-2V2tp + p'). (4.111) 

The conjugate variables are given by derivatives of the generating functions: 

dFi dFi 

'--^ ^'-'"'^ 
dF2 

"=^' ^ = ^- ('-'^'^ 

In order to evaluate the metaplectic integrals in Equations fl4.108p and fl4.109p . 
we write q in terms of t and possibly also derivatives with respect to t, i.e. we find 
the t representation of the operator q. The properties of the generating functions 
allow us to do this. First, use Equations fl3.7ip and fl4.112p to write 

q = ±(t-k) = ^it- d,F,{t, q)). (4.114) 

Now, notice that we can combine the derivative dtFi(t,q) with the phase in the 
integral to write 

dtF^it, g)e'^i(*''?) = -idte'^'^''''\ (4.115) 

We can now consider our corrections as a pseudodifferential operator acting on our 
function, by making the substitution 

q-^l={t + idt), (4.116) 
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wherever q appears in our corrections. In order to do this, write the corrections as 
a Taylor's series: 

SM = J- (4.117) 

Vl - eOig 

= 5^s„g" (4.118) 

n 

We associate this with a pseudodifferential operator by replacing q with [t+idt] / 

^ E^(^ + ^^*)" (4.119) 
= sA^{t + idt)\ (4.120) 



.V2 

We can now use this to find the t representation of the field. 

oo 

^;™)(t) = j e'^'^'''^'>S,iq)dq (4.121) 

— oo 

oo 

= S,(^-^{t + tdt)^ j e'^^^'^'^Uq (4.122) 

— oo 

The integral can be recast as a Gaussian integral, since the phase is quadratic in q. 
^(WKB)^^^ = (^-L(t + tdt)^ V2^e'^/^e-''"/^ (4.123) 

= v^e-/^ + + '^'^ + + + ^(^'0 (^-^2^) 

= v^e^-/^ (^1 + ^(V2t) + ^ ((v^t)^ + 3^V2t) + 0{e')^ e'^^'/^ 

(4.125) 



gjeci(v^t)V3 



2Txe'^l^e-'' ^ (4.126) 
1 + e(-6i + 2ci)v^t 



The appearance of the ci term in the amplitude is not too surprising. Under the 
Fourier transform, the pure phase exp(zcig^/3) turns into an Airy function, which 
has amplitude variations. Converting from the q representation to the t representa- 
tion is done with a metaplectic transformation, which is a sort of "partial" Fourier 
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transform. Therefore, we could expect that the cubic phase in q would give rise to 
an amplitude variation in t. 

A similar analysis can be computed for the lower channel. Since our corrections 
are written in the p representation, we need to make the substitution 

p=l={t + k)-.^{t-td,). (4.127) 

Therefore, the WKB mode in the lower channel is 

^(WKB)^^^ = (^-L(t _ ^^^)^ ^e-'-l^e''"/^ (4.128) 

(4.129) 
(4.130) 

„ pieC2{V2tf/3 

1 + e{b2 - 2c2)V2t 



We can now compare our analytical expressions (Equations (14.1261) and (14.1311) ) 
and numerical simulations. As seen in Figure (14.41) . these corrected solutions corre- 
spond closely to the numerical simulations. The amplitude of the analytical solutions 
now contain the square root variation which is due to action conservation, so they 
match the WKB solutions over a much wider range than before, cf. Figure (13. 6p . 
The phases of the solutions also show good agreement with the numerical simula- 
tions. Notice that Figure (14. 4p plots the phases relative to the WKB phase, since 
it changes rapidly compared with the variations due to the coupling and the higher 
order terms. The calculation for the phase of the second channel is actually off by 
a factor of tt/A. This error is possibly related to the choice of phase in the normal- 
ization of the metaplectic integral, or perhaps due to the choice of sign convention 
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FIG. 4.4: A comparison of the amplitudes and phases of the corrected local solutions 
(Equations (|4.126p and (|4.13ip . black) with the amplitudes of a numerical simulation 
of the original equations (red). The coupled WKB solutions are shown by the dashed 
line. Notice how our new analytical solutions capture the slow amplitude variation due 
to action conservation. Simulation parameters: ai = 3 x 10^'^, 5i = 7.4 x 10^"^, ci = 
1 X 10"'*, a2 = 3 X 10^^,62 = 7.4 x 10"^, C2 = -5 x 10""*. As discussed in the text, the 
phase of the second channel contains an unaccounted-for factor of 7r/4, which has been 
corrected ad hoc in this plot. 
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for the plane waves. Whatever the cause, the difference has been removed ad hoc in 
Figure (14.41) . so that the smaller scale variations could be compared. 



CHAPTER 5 



Summary 

In Part I of this dissertation, we gave a brief introduction to phase space meth- 
ods for scalar wave equations. In Chapter [2] we reviewed the connection between 
ray-tracing techniques and the WKB approximation. This connection is most natu- 
ral when considered from the point of view of (x, k) phase space. The Weyl symbol 
of the wave operator is the dispersion function on phase space, and the zero sur- 
faces of the dispersion function define the dispersion relations for the problem being 
considered. The dispersion function can then be used as a ray hamiltonian, and a 
family of rays generated by this hamiltonian can be used to reconstruct the phase 
and amplitude of an approximate solution to the wave equation. 

In Chapter [3l we consider the problem of multicomponent wave equations. 
Phase space techniques can also be used for multicomponent problems, as long as 
the vector nature of the waves can be accounted for. Generically, the eigenvectors 
of the dispersion matrix can be used as a polarization basis for the wave, and the 
multicomponent problem reduces to several scalar wave problems. The eigenvalues 
of the dispersion matrix then generate the rays for each of the different modes, and 
the zeros of the eigenvalues give the dispersion relations for each mode. However, 
the eigenvectors become nearly degenerate in mode conversion regions, and different 
modes can interact, exchanging energy. We described this process by using an 
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example of two coupled oscillators with slowly varying natural frequencies. This 
pair of oscillators was analyzed using the phase-space tools which were described 
in the context of wave problems. When the frequencies of the oscillators is nearly 
equal, the oscillators can resonantly exchange energy in a process analogous to mode 
conversion. We described the matched asymptotic solution for the mode conversion 
problem, and showed numerical simulations for the coupled oscillator example. Up 
to this point the material presented has been for the purpose of review. 

We then moved on in Chapter |4] to a new calculation. The matched asymptotic 
techniques used to solve the mode conversion problem required the linearization of 
the dispersion matrix about the mode conversion point. We showed how to extend 
the local analysis of the mode conversion to include quadratic order terms in the 
dispersion matrix. This analysis gave us a new way to correct the local solution, 
which resulted in much better matching between the local solution and the far- 
field WKB solutions. In the process of calculating these corrections, we showed 
that, for a mode conversion in one spatial dimension, the dispersion matrix can 
be put into normal form through second order. This was done by a near-identity 
change of polarization basis, which made the off-diagonal terms constant (through 
second order). We also showed that the Moyal corrections introduced by this change 
of polarization basis enter at a higher order, and therefore, at the order we are 
considering, can be neglected. 

The phase space analysis which was described and used here in Part I are 
very useful. The phase space perspective can provide geometrical insights, which 
can guide analytical calculations. Also, phase space ray-tracing algorithms can be 
used to solve multicomponent wave problems which exhibit mode conversion, by 
treating the mode conversion as a ray splitting. However, there are still problems 
which cannot be solved using the approach described in the previous chapters. For 
example, mode conversions where the dispersion surfaces meet tangentially, instead 
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of transversely, still need to be dealt with from a ray-tracing perspective. Also, wave 
problems in multiple spatial dimensions can have rays with finite helicity, leading 
to new types of nonstandard mode conversions Sj . Problems such as these have led 
us to explore the mathematics which underlie the phase space analysis described 
in these chapters. In particular, we will examine in Part II of this dissertation the 
connection between the phase space path integral, the Weyl symbol, and the theory 
of the irreducible representations of the Heisenberg-Weyl group. These connections 
are best examined in the context of a new theory of symbols, which has recently 
been developed by Zobin, and which describes the symbol of an operator in terms 
of a double Fourier transform. In Part II, we describe this new theory of symbols, 
and show how it suggests several new avenues of future research. 



Part II 

The Group Theoretical 
Foundations of Path Integrals 
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CHAPTER 6 



Introduction 

In Part II of this dissertation, we will delve into the mathematical foundations 
upon which the phase space techniques used in Part I are built. By doing so, the 
richness of these foundations are shown, and new areas of research are opened up. 

In Chapter [7] we review the theory of groups and their representations. This 
groundwork, while well known to some mathematicians and physicists, is neces- 
sary for the subsequent chapters, and worthwhile reviewing. In particular, we will 
describe the regular representation, and how it reduces uniquely to primary rep- 
resentations. Further reduction of the primary representations to the irreducible 
representations will require a choice of basis. We will describe this reduction in 
the case of the Heisenberg-Weyl group, and show how it is connected to choosing 
a representation (e.g., g-representation or ^-representation) for the wavefunction in 
quantum mechanics. 

Chapter [8] continues the review of group theory by describing the Fourier 
transform in the context of groups. We describe the Fourier transform for non- 
commutative groups, and show how it can be used to map operators (which are 
embedded into sections of the dual bundle) to functions on the group. We then 
discuss Zobin's theory of symbols, which defines the symbol of an operator as a par- 
ticular sequence of Fourier transforms. First an inverse non-commutative Fourier 
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transform takes an operator to a function on the group, where the group is con- 
sidered as a set. Since the group (considered as a set) also has a commutative 
group structure, there is also a commutative Fourier transform associated with the 
group. This commutative transform is performed as the second step in calculating 
the symbol of an operator. We then describe how the Weyl symbol is a special case 
of the Zobin symbol for the continuous Heisenb erg- Weyl group, and how it is asso- 
ciated with the reduction of the regular representation to the primaries. We end the 
chapter with the application of the Zobin symbol to the discrete Heisenberg-Weyl 
group. This allows us to define the "symbol" of a matrix, which is a complex-valued 
function on a (discrete) space. This novel type of symbol not only provides a new 
way to analyze matrices, but also shows how the group theoretical approach to the 
theory of symbols is widely applicable. 

We next examine functions of operators in Chapter [91 We show that calcu- 
lation of the symbol of a function of an operator can be recast into the form of a 
path integral. While the connection between Weyl symbols and path integrals has 



been examined previously [2l|, we will analyze this connection in the more general 
context of the Zobin symbol. This new context for the path integral leads to several 
new insights. First, when calculating the discrete "path integral" for the discrete 
Heisenberg-Weyl group, we see that there is a natural connection between the path 
integral and sums over probability densities (or measures). This opens the door 
to the use of techniques from statistical mechanics and information theory for the 
analysis of path integrals; techniques such as using maximum entropy calculations 
for finding the most likely "path" . The path integral for a continuous group can be 
recast as an infinite- dimensional Fourier transform. In the discrete case, the sum 
over all possible measures can be written in a way where it is evidently a finite di- 
mensional Fourier transform. Taking the limit of this to get to the continuous group 
turns the sum over measures into an integral over the space of measures, which then 
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looks like an infinite-dimensional Fourier transform on the space of measures. The 
second insight into path integrals that is obtained from this group theory perspec- 
tive is the realization that the connection between the phase space path integral and 
the configuration space path integral stems from the reduction of the regular rep- 
resentation of the Heisenberg-Weyl group. Reduction of the regular representation 
to the primary representations involves considering functions on phase space, rather 
than functions on the whole group. The path integral then becomes an integral 
over paths in phase space. The further reduction of the primary representations 
to irreducible representations requires choosing a "configuration" subspace in phase 
space, and only considering functions on configuration space. In this case, the path 
integral is further restricted, and becomes an integral over paths in configuration 
space. 

We conclude Part II with a survey in Chapter [10] of several areas of research 
opened up by this new group theoretical perspective on path integrals. First we 
describe how to define a new normal form for the dispersion matrix for vector wave 
problems. The diagonal elements are now associated with "uncoupled" modes, and 
can be used as hamiltonian functions for generating rays. This approach could sim- 
plify the geometry of the dispersion surfaces for mode conversion problems, turning 
"avoided crossings" into transverse intersections of the dispersion surfaces for the 
interacting modes. A second idea suggested in Chapter [TD] is a proposal to create a 
"double symbol" for vector wave problems. The standard approach computes the 
Weyl symbol of each element the wave operator individually. This gives a matrix- 
valued function on phase space. But, as shown in Chapter [HI the symbol of a matrix 
can also be computed. This suggests combining the Weyl symbol with the symbol of 
a matrix, to get a new sort of "double" symbol for the wave operator. The last idea 
presented in Chapter [TD] is related to dealing with uncertainties in mode conversion 
problems. For example, turbulent fluctuations are often present in plasmas, and we 
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would like to be able to analyze mode conversion problems even in the presence of 
such fluctuations. We examine the possibility of introducing decoherence into mode 
conversion problems through an appropriate averaging of the Wigner function. This 
procedure (which is done in the spirit of quantum decoherence of the density ma- 
trix) smooths out the complicated interference patterns in the Wigner function, and 
results in a much more "classical" distribution on phase space. 



CHAPTER 7 

Review of the Theory of Linear 
Representations of Groups 

This chapter will provide a review of the theory of groups, and linear repre- 
sentations of groups. Since it is intended to be a review, results will be presented 
without proofs. Also, many general results will be stated for finite groups. Such 
general results for infinite groups are difficult, if not impossible to come by. However, 
the infinite groups that we are interested in (such as the Heisenberg-Weyl group) 
are nicely behaved, and the results obtained in the finite case can be shown to also 
hold in these specific infinite cases. For more details and proofs, the textbooks by 



Serre 22| and Kirillov 23| are good references. 

The main point of this review is to describe the regular representation, and how 
it naturally and uniquely reduces to the primary representations. After a further 
(non-unique) choice of basis, this reduces to the irreducible representations. For 
the Heisenberg-Weyl group, this series of reductions will take us from functions 
on the group, to functions on phase space, and then to functions on configuration 
space (more precisely to any Lagrange plane in phase space). The choice of basis in 
this case corresponds to the choice of position and momentum coordinates in phase 
space. 



89 



90 

7.1 Groups and Group Actions 

A group is a set G endowed with an associative product rule, or group mul- 
tiplication, denoted here as o. There must be a neutral element in the group (the 
"identity" element), and each element in the group must have an inverse. 

A simple example of a group is the set of all integers, with addition as the 
group product rule. Zero is the identity element of this group, since adding zero to 
an integer gives the same integer back. The negative integers are the inverse of the 
positive integers. 

Commutation 

An important characteristic of any given group is whether the group product 
law is commutative. If the law is such that 

gi 092 = 92 091, \/9i,92^G (7.1) 

then the group multiplication commutes, and the group is called commutative, or 
Abelian. Normal addition is commutative, so, for example, the set of real numbers 
forms a commutative group, using addition as the group "product". A non-Abelian, 
or noncommutative group has a group product law which is not commutative. This 
makes the group structure much richer, and gives many groups their interesting and 
useful properties. 

Subgroups 

A subgroup H of some group G is a subset of the elements of G which is a group 
under the multiplication inherited from G. It is a smaller group which is embedded 
in the structure of the larger group G. All products and inverse elements must be 
in the subset. 

h.keH =^ hok,h-\k-^ e H (7.2) 
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A particular subgroup which is of some importance is the so-called center of 
the group. The center is the set of elements of the group which commute with any 
other element of the group, and is denoted Z{G). 

Z{G) = {keG\ygeG,kog = gok} (7.3) 

The identity element is definitely contained in the center, since multiplication from 
the left or from the right by the identity leaves all elements unchanged. Also, direct 
computation shows that if an element h is in the center, then its inverse is also 
in the center. Therefore, the center is a subgroup of G. 

Group Actions 

While groups are interesting as abstract entities, they become a powerful tool 
for the study of physics through their actions. In order to properly define group 
actions, it is necessary to first define a specific group; the group of transformations 
of a set X. An element of the group of invertible transformations is a mapping 

4>:X^X, (7.4) 

and the product law in the group of transformations is simply the composition of 
mappings. 

01 02 = 010 02 (7.5) 

With this definition of the group of transformations, a generic group action can now 
be defined. A group action A of the group G is a mapping of G into the group of 
invertible transformations of some set X. 

g f-^ Ag, where Ag : X ^ X. (7.6) 

This can also be written as 

Ag{x) = x' , g E G; x,x' G X. (7.7) 
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In order for the action to satisfy the group product law, we must have the following 
property for the composition of actions. 

Ag,,g,{x) = Ag,{Ag,{x)) (7.8) 

An example of an action is the translations generated by the group of integers, 
G = {Z, +}. The translation of a function on the real line can be written 

(T„/)(x) = /(n + x). (7.9) 

This action satisfies the group product law, as can be seen by direct computation. 

{Tn+mf) = f{n + m + x) = {Tn{TrrJ)){x) (7.10) 

So translation by integers is an action of the group {Z, +} on the set of functions 
L2(M) on the real line. 

7.2 Group Actions and Linear Representations 

Since group actions respect the group product law, they give us a way to study 
properties of the group. Because of the encoding of the product law in the action, 
the action provides us with a substitute for the group. The most general group 
actions are not necessarily the best objects to study, however, since in general the 
group action could be given by a nonlinear transformation. A simpler case would be 
an action which is a mapping into linear transformations. Such a mapping is called 
a linear representation of the group, or often just a representation of the group. A 
linear group representation p is a mapping from the group to the space of operators 
acting linearly on some vector space V . 

p:G^ GL{V) (7.11) 
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The operators p{g) must satisfy the group product law in order to be a representation 
of the group. 

p{9i0 92)v = p{gi)p{g2)v (7.12) 

Ahhough this notation is different, this is essentially the same as Equation (17.81) 
which gives the essential property of a group action. 

Group actions can take many forms. For example the nonlinear transformation 
given by the set of all possible linear fractional transformations which preserve the 
upper half plane can be thought of as a nonlinear representation of the group of 
2x2 symplectic matrices. The linear fractional transformations map the complex 
plane onto itself. 



ai[z 



aiz + bi 



(7.13) 



Ciz + di 

In order to preserve the upper half plane, we require the real parameters ai,6i,ci, 
and di satisfy the property 

aidi - bici = 1. (7.14) 

The composition of two linear fractional transformations is also a linear fractional 
transformation, given by 



^1(^x2(2;)) 



(7.15) 



v 



as ^3 
C3 c?3 



V 



HJ 



02 62 
C2 d2 



\ 



J 



(7.16) 



C3Z + d3 

where 

/ \ / 

ai bi 
Ci di 

Note: if we flip the overall sign of these matrices, we get the same cr(z); this mapping 
from matrices to linear fractional transformations is 2 to 1. 

Together, the properties in Equations fl7.14p and fl7.16p imply that the linear 
fractional transformations (which preserve the UHP) are a representation of the 
set of real 2x2 matrices with determinant one, modulo an overall sign, called 
PSL(2,M) = SL(2,M)/{±1}. 
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While nonlinear actions such as that in Equation (17.131) are perfectly valid as 
group actions, it is often more convenient to use linear representations when studying 
a group. A representation p is said to have dimension n, where n is the dimension 
of the linear vector space V on which p acts. The dimension could be infinite, as is 
the case if V is an infinite-dimensional Hilbert space. 

For linear representations which act in n-dimensional {n < oo) vector spaces, 
the representation can be thought of as mapping elements of the group to n x n 
matrices. Here are some properties of such a representation: 

• V = vector space of dimension n 

Elements of V are vectors, or can be thought of as a functions f{x) on an 
n-point lattice, where x labels the points of the lattice. 

• GL{V) = group of automorphisms of V 

M G GL(y) is an n X n matrix (an operator) acting on the space of functions 
on a lattice 

• p{g) = a homomorphism of G into GL(y) 

p{g) can be thought of as a matrix 

p{9i Ofi'2) = p{.9i)p{.92) is given by matrix multiplication 

Returning to the case of linear representations of arbitrary dimension, there are 
a few last notions worth discussing at this point. The first is the notion of unitary 
representations. If the vector space V on which the representation acts is a Hilbert 
space, that implies that there is a well defined notion of lengths of vectors in V . 
Operators which preserve the lengths of all vectors are called unitary operators, and 
a representation 

p:G^U{y) (7.17) 
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which maps the group to unitary operators is called a unitary representation. In this 
work we are mostly interested in unitary representations. For the groups of interest 
here, unitary representations always exist. 

Finally, there is a notion of equivalent representations. If two representations 
Pi and p2 are related by a constant similarity transformation, they are said to be 
unitarily equivalent. 

p,{g) = U-'p2{g)U, ygeG ^ p,r^p2 (7.18) 

Two unitarily equivalent representations have the same dimension. When discussing 
a representation of a group, we often really mean the equivalence class of the rep- 
resentation, since the particular choice of a representation in an equivalence class 
may not matter. In cases where the distinction between a particular representation 
and its equivalence class is important, we note it in the text. 

7.3 Reductions of Linear Representations 

Let p : G ^ GL{V) be a linear representation of the group G acting on the 
vector space V. For the moment, let G be a finite group, and p be a finite unitary 
representation, in order to simplify this discussion. Although the proof is not simple, 
these results also apply to the infinite groups we want to consider. 

Let be a subspace of V. The subspace W is invariant under the action of 
the group if 

\/g eG,w eW, p{g)w eW. (7.19) 

If such an invariant subspace exists, then the restriction of p to this subspace is itself 
a representation of the group. 

p\w-G^ GL{W) (7.20) 
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This is called a subrepresentation of p. 

For the groups and representations that we are interested in, the invariant 
subspace W will have a complementary subspace W± which is also invariant. This 
means that the representation p can be put into block diagonal form, with the 
subrepresentations p\w and p\w±_ on the diagonals, and zeros in the off-diagonal 
blocks. 



Pig) 



p\w{g) 



\ 



I 



(7.21) 



pWAa) 

This breaking of p into blocks can be continued if we can find invariant subspaces in 
W or Wj^. If the space V can be written as ^ = Wi + W2 + . . ., where all of the Wi 
spaces are invariant subspaces, then we get a block diagonalization which reduces p 
into a direct product of representations acting in the smaller vector spaces. 

/ 



P{9) 



\ 



p\wA9) 






\ 




p\w2i.9) 










p\wA9) 













(7.22) 



If a representation cannot be reduced any further, then it is called an irreducible 
representation. The set of all (classes of unitarily equivalent) irreducible representa- 
tions is called the dual to the group G, and is denoted G. For the most interesting 
groups, any unitary representation of the group is made up of a direct product of 
irreducible representations. 



7.4 The Regular Representation and Observations 
About its Reduction 



There are two important representations that can be formed for any group. 
These are the right and left regular representations. These representations map the 
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elements of the group into shifts of functions on the group itself. Consider L2{G, dfi), 
the space of complex valued functions on the group, / G L2{G, dfi) : G ^ C Here 
the measure dfi is the Haar measure, which is a shift-invariant measure which will 
be discussed later. The right regular representation maps group elements to right 
shifts of functions in L2{G, dfi): 

{pR{h)f){g) = f{goh). (7.23) 

Some algebra shows that this is a representation of the group: 

{pRih o h2)f){g) = f{g oho h2). (7.24) 

To show this is a representation, we need to compare this to the composition of 
shifts. 

{pR{h,) o PR{h2)f){g) = {pR{h2)f){g o h) (7.25) 

= f{gohioh2) (7.26) 

So this is a representation. The regular representation works by shifting the elements 
of the group according to the group multiplication law. For a finite group, this can 
be pictured as a shuffling of the group elements in a way which is prescribed by the 
multiplication table for the group. Each matrix of the regular representation pR{g) 
is then just a big permutation matrix, and knowing the set of all of these matrices 
for each g & G would allow you to reconstruct the multiplication table. 

We can also form a representation using left shifts, which is called the left 
regular representation. 

{pL{h)f){g) = f{h-Ug) (7.27) 
Again we can check that this is a representation. 

{pL{h o h2)f){g) = fiih o h2r' o g) (7.28) 
= f{h2'oh-^^og) (7.29) 
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The composition of shifts is given by 

{pL{h{)opL(h2)f){g) = {pL{h2)f){K'og) (7.30) 

= f{h-^oh-^og) (7.31) 

These two regular representations are important because they can be constructed 
for any group. 

For the types of groups we want to consider, the space of functions has a measure 
which is invariant under left or right shifts. 

d{goh) = dg, or d{h~^ o g) = dg (7.32) 

While the left invariant measure is not in general the same as the right invariant 
measure, it is the same for the groups we want to consider. The unique (up to a 
constant) invariant measure on the group is called the Haar measure for the group. 
Given the Haar measure, the notion of an L2 norm can be defined. With this norm, 
the regular representations are unitary representations. 

j \[pR{h)f){g)'\ dp{g) = J \f{goh)\'dp{g) (7.33) 

G G 

\f{g')\'df^{g'oh-') (7.34) 



G 



\f{g')\'df,{g') (7.35) 



G 



This substitution is possible because the measure is invariant under the shift g' = 
goh. A similar calculation shows that the left regular representation is also unitary. 

The regular representations are nice for another reason. Even though the regu- 
lar representations are not irreducible, they can be reduced, and in fact are made up 
of a direct product of copies of all of the irreducible representations of the group. So, 
for the groups that we are interested in, one way to try to find all of the irreducible 
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representations would be to start with the regular representation and reduce it as 
far as possible. 

One way that the reduction of the regular representation could be achieved 
would be to consider the commutative subgroups H of the group G. Irreducible 
representations of these subgroups are all one dimensional, and so are straight- 
forward to find. Consider the following series of observational! about the regular 
representation and the subgroups of G. 

Observation: Consider the following closed subspaces of L2{G). Choose any 
commutative subgroup H < G, and any one dimensional irreducible representation 
r of H. Define the space of covariant functions as 

S{H, r) = {f ■.^heH,WgeGJ{goh) = T{h)f{g)} (7.36) 

If we use the subgroup H to define equivalence classes in G, then we see that all 
functions / G S{H, r) can be described by their values on a smaller set of elements 
in G. If we pick one element k from each equivalence class m. G/H (this is a set of 
representatives for the classes), then / is determined by its values on the /c's. 

^g = koK fig) = fik oh) = T{h)f{k) (7.37) 

This means that the space S{H^ r) is smaller than the original function space L2{G). 
The dimension of S{H, r) is equal to the number of equivalence classes in G/H. 

Observation: The subspace S{H,t) is invariant under the action of the left 
regular representation. 

{pL{g2)f){goh) = f{g2'ogoh) (7.38) 
= T{h)f{g2'og) (7.39) 
= T{h){pL{g2)f){g) (7.40) 



^As is the case with most of my knowledge of group theory, I am indebted to Prof. Zobin for 
this series of observations, which he outhned for me. 
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Since this space is a pi invariant subspace, it can be used to reduce the left regular 
representation. 

Observation: Larger commutative subspaces will reduce the regular represen- 
tation more. If H2 > Hi, then S{H2, r) C S{Hi, r). This follows from the fact that 
the covariance condition in the definition of S{H2,t) will impose more constraints, 
and so fewer functions will satisfy the condition. 

Observation: Choose if to be a maximal commutative subgroup. Then 
S{H,t) will be minimal. Very often (i.e., for all of the interesting groups) the re- 
striction Pl\s{h,t) of the regular representation to this subspace will be an irreducible 
representation. Also very often, the restrictions for all maximal commutative H and 
all irreducible representations r will give you all of the irreducible representations 
of G. 

Observation: Some of these restrictions will give unitarily equivalent repre- 
sentations. 

Pl\s(h,t) ~ Pl\s(H',t') for some {H, r), {H' , r) (7.41) 
For Lie groups, the Kirillov orbit method 23| can be used to distinguish and label 



the various inequivalent irreducible representations. For the Heisenberg-Weyl group, 
we can use the Stone-von Neumann theorem to check whether two irreducible rep- 
resentations are unitarily equivalent, as will be discussed. 



7.5 Primary and Irreducible Representations 

While the irreducible representations are in some sense the most fundamental 
representations of a group, when decomposing any given representation it is often 
more natural to work with primary representations. In this section, we will de- 
scribe how to reduce the regular representation of a group, and introduce primary 
representations in this context. The reduction of the regular representation can 
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proceed unambiguously to the level of the primaries. This reduction is called the 
canonical decomposition 22j. Each primary representation in turn is a direct sum 
of several copies of some particular irreducible representation. Since each primary 
carries with it information about only one irreducible representation, the primary 
representations provide a nice decomposition. However, since they are made up of 
several copies of the irreducible representation, there is still some redundancy. At 
this point, it is necessary to introduce an arbitrary block basis into the vector space 
on which the primaries act in order to split the primary representations into irre- 
ducibles. In some sense, the primaries are therefore more natural to work with than 
the irreducibles, since the irreducibles require this arbitrary choice of basis. 

As will be discussed in later sections, the distinction between the primary repre- 
sentations and the irreducible representations for the Heisenberg-Weyl group roughly 
becomes a distinction between considering functions on classical phase space and 
functions on configuration space. The choice of coordinates for configuration space 
is arbitrary, since a canonical transformation will take us from one set of coordi- 
nates to another. Similarly, the reduction to irreducible representations from the 
primaries requires a choice of basis, or "coordinates". This choice of coordinates 
splits the phase space into "positions" and "momenta". 

The Regular Representation 

The regular representation of a group arises when one considers the space of 
complex valued functions defined on the group itself; f : G ^ C Since the group 
is a set, or even, as is the case for Lie groups, a manifold, it is natural to think 
of functions which map this set to the complex numbers. The space of all such 
functions forms a linear vector space, with addition of the vectors being defined as 
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the point-wise addition of the complex values of the corresponding functions. 

{fi + f2){9) = Mg) + f2{9) (7.42) 

This vector space, jFg, is used to form (left) the regular representation of the group 
through the definition 

{pL{h)f){9) = f{h-'og) (7.43) 

The computation in the previous section shows that this is a representation. 

While the regular representation is perhaps the most natural representation 
of a group, it is not an irreducible representation. In fact, for the groups we are 
considering, it contains copies of all of the irreducible representations of the group. 
In order to reduce the regular representation, we need a way to find all of the in- 
variant subspaces of J^g- This could be achieved by defining an arbitrary basis set 
for J^G, and then writing the decomposition in terms of that basis. However, a less 
arbitrary way to perform the decomposition could be achieved if we can define pro- 
jection operators, each of which projects onto an invariant subspace. It turns out 
that these projection operators can be found, but they do not project onto the sub- 
spaces associated with the irreducible representations, but rather those associated 
with the primaries. Unlike the decomposition into irreducible representations, the 
decomposition into primaries is unique. 

Decomposition of the Regular Representation 

Let the irreducible representations of G he pi : G ^ GL{Vi). For the calcula- 
tions in this section, we let G be a finite group. The results also hold for the infinite 
groups that we want to study, but the derivations are much more complicated. 

Assuming that the regular representation is a direct sum of irreducible repre- 
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sentations, we can decompose the space Tq into a direct sum of the spaces VJ. 

^G = 0^fc (7.44) 

Since each irreducible representation may appear in px, more than once, there may 
be several copies of each Vi in Tq- Let mj be the multiplicity of pi in p/,. Then 
define the space Wi as 

Wi = m,V, = (7.45) 

rrii times 

This is the space on which the primary representation acts. Define the primary 
representation as 

rrii 

A = 0P.. (7.46) 

fc=i 

This is a representation of the group pi : G ^ GL(Wi). Each primary representation 
contains mj copies of the irreducible representation pj. While the primary is a 
reducible representation, its reduction to irreducibles is not unique. Its reduction 
requires a choice of basis for the decomposition of Wi, as we'll show. 
We can then write the decomposition of the space To uniquely as 

^G = 0iy.. (7.47) 

i 

The regular representation then decomposes into primaries. 

PL = 0P. (7.48) 

i 

This reduction is easier to understand if we write the matrix associated with the 
representations. The decomposition of the regular representation into primaries 
puts the primaries into blocks on the diagonal, with the proper arrangement of 
basis functions. 



PL 



Pi 






\ 




P2 










p-i 













(7.49) 
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Each of the primaries acts separately on the subspaces Wi. The primaries could be 
reduced into block diagonal form, with rrii copies of the irreducible representation 
on the diagonal, by choosing some block basis. 

/ 



pi 



\ 



P^ 






\ 




pi 










pi 













(7.50) 



The block basis does not require the choice of the entire basis, just one basis vector 
in each of the subspaces in Wi on which the irreducibles act. The rest of the basis 
vectors are then generated by the action of the representation on each of chosen 
vectors. 

These two reductions [pL — ^ Pi and pi pi) can both be performed for a 
general group. Perhaps the clearest way to do this is using projection operators, as 



m 



221. 



From Regular to Primaries 

To construct the projection operator, we need to first define the character of 
a representation. The character of a representation is a function on G defined for 
some representation p as 

Xp{g) = ^T[p{g)] (7.51) 
This function has many nice properties 221] . For example, the characters of irre- 



ducible representations satisfy an orthogonality relation as functions in Tq-, and 
therefore "characterize" a representation, in some sense. The scalar product in JF^ 
is defined as 

Ui\f2) = ^Y.f^^9)r2{9\ (7.52) 



G 
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where Nq is the order of the group G. In the case of finite groups, the order is 
simply the number of elements in the group. With this product, the orthogonality 
of the characters of the irreducible representations can be written 

iXilXj) = Sij, (7.53) 

where Xi ^^'^ Xj ^ire the characters of the irreducible representations pi and pj, and 
6ij is the Kronecker delta. 



As described in Section 2.6 of 



22| , we can use the characters of the irreducible 



representation to construct the projection operator 

P^ = ^J2^:i9)PL{9), (7.54) 

g&G 

where Xi(fi') is the character of the irreducible representation pj, and is the di- 
mension of Pi. Consider the action of this operator on an element / of one of the 
subspaces Wj, i.e., / is nonzero only on Wj and zero everywhere else. Then, since 
Wj is invariant under the action of the regular representation, we get 

= ^Y.^:(9)PL{9)f (7.55) 

= ^Y.^:i9)pMf (7-56) 

9&G 

Since pi is an irreducible representation, we can use Schur's Lemma to rewrite this 
sum in terms of the inner product in Equation f l7.52p to obtain 

77- Nr< 

P^f=^ — iX^\X,)f = S^Jf. (7.57) 

So, the operator Pj is a projection operator. If the element / lies in the subspace Wi, 
then Pi leaves it unchanged, but if / has no component in Wi, then it is sent to zero 
by Pi. This projection operator lets us unambiguously define the subspaces on which 
the primary representations act. So we have decomposed the regular representation 
into primary representations, and this decomposition is unique. 
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Prom Primaries to Irreducibles 

The further reduction of the primary representations pi into irreducible repre- 
sentations requires an arbitrary choice of a block basis in the spaces Wi. Instead of 
a complete discussion of the most general case, it is perhaps better to illustrate the 
procedure with a simple example. (For another example, see Section 17.7.11 where 
the reduction of the regular representation of the finite Heisenberg-Weyl group is 
carried out in detail.) 

Consider the group with only one element. This element must be the identity 
element, and the group product always maps back to this element. G = {e}, eoe = e. 
This group has one unitary irreducible representation, which is multiplication by 1. 

p:G^ GL(M), p{e)z = idz = z. (7.58) 

Consider the reducible representation formed by the direct sum of two copies of p. 
This is the identity operator on a 2-dimensional vector space. 

p:G^ GL{R^) (7.59) 

We can decompose the action of p into its action on one dimensional subspaces 
simply by choosing two linearly independent vectors (u, v) in the plane. Scaling these 
vectors will form independent subspaces U = {Xu\X G M} and V = {Xv\X G M}. 
Then, by restricting the action of p to these spaces, we can write the decomposition 

p = p\u®p\v- (7.60) 

Each of the restrictions p\u and p\v are equivalent to the irreducible representation 
p, so we have reduced the primary representation to irreducible representations. 
However, the reduction required the choice of the vectors u and v, so this reduction 
is not unique. 



7.6 The Stone— von Neumann Theorem 
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The essence of the Stone-von Neumann theorem is this; the muhidimensional 
irreducible representations of the Heisenberg-Weyl group can be labeled by their 
action on the center Z{S^). 

Since elements of the center commute with all others, any representation of 
the center must also commute with all other elements of the representation. This 
means that unitary representations of the center are given by a phase times the 
identity operator. The Stone-von Neumann theorem says that this phase identifies 
the irreducible representation up to a unitary transformation. Furthermore, the 
unitary transformation which relates the equivalent representations is induced by a 
symplectic group automorphism. 

Any group automorphism will take an irreducible representation into another 
irreducible representation. In general, the new irreducible representation will not be 
unitarily equivalent to the original representation. We can ask what conditions are 
required such that the new and old representations willhe unitarily equivalent. The 
Stone-von Neumann theorem answers this question for the Heisenberg-Weyl group. 

There are two types of automorphisms of the Heisenberg-Weyl group. A sym- 
plectic transformation is an automorphism of which is the identity mapping for 
elements of the center Z{S)). The other type of automorphism of is a group 
dilation. The Stone-von Neumann theorem tells us how these automorphisms are 
related to the irreducible representations. Symplectic transformations take an irre- 
ducible representation into an equivalent representation. 



However, group dilations take the representation into an inequivalent representation. 



Sym 



(7.61) 



P 



Dilation 



P' 7^ P 



(7.62) 



P 
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According to the Stone-von Neumann theorem, all inequivalent irreducible repre- 
sentations can be generated from one irreducible representation by application of 
the group dilations. 

We can write these relationships in a diagram. Here is the Heisenberg-Weyl 
group. U{T-Cp) is the set of unitary operators on the Hilbert space Tip associated 
with the irreducible representation p. 

9) u{np) 

Sym Met (7.63) 

The unitary transformation which relates the old representation p with the new 
representation p' is called a metaplectic transformation. 

A symplectic automorphism can be thought of as a relabeling of the group 
elements in the phase space component of the group, where the relabeling is done in a 
way consistent with the group product law. A linear canonical transformation of the 
phase space components is such a transformation, since it preserves the symplectic 
product uj{zi, Z'l). 

These ideas can be illustrated by specific examples. In the following sections, 
we will show how the Fourier transform arises as a consequence of the Stone-von 
Neumann theorem, and how the metaplectic transformations can be thought of as 
a generalization of the Fourier transform. 



7.7 Examples 

7.7.1 The Discrete Heisenberg-Weyl Group 

Definition of the group 

The discrete Heisenberg group is made up of elements that each have three 
components. Two are "phase space" components z = {q,p) and the other is a phase 
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A. 

Sjn:{9 = {z, X)\z = {q,p) G Z„ x Z„; A G Z4 (7.64) 

For concreteness, we will use Sjs to give explicit examples and to draw pictures. 
Results are quoted for general S^n when the generalization is straightforward, as it 
often is. Figure 17.11 shows the points in the group embedded in space, for the case 
where n = 3. The group multiplication law for this group is defined as 

9i 92 = {zi + Z2, Ai + A2 + uj{zi, Z2)) mod n (7.65) 

where uj is the symplectic product 

i^{zi, Z2) = qiP2 - q2Pi = zi ■ ■ Z2 (7.66) 

The phase component A of the group product records the symplectic product of the 
phase space components, and makes this a noncommutative group. Note that if 
n = 2, this reduces to a commutative product law, which is why we use n = 3 and 
not 77, = 2 in our examples. 

Now, for n = 3, there are nine possibilities for z = {q,p), and three for A. We 
have the following list of 27 group elements. 
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FIG. 7.1: The elements of the discrete Heisenberg-Weyl group embedded as points in 
space. The shaded plane at A = is the phase space, with elements g = 0). 
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(7.67) 
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This listing of the group elements can be used in a lexicographical way to 
write the multiplication table for the group. The list gives labels to each element, 
1 = (0, 0, 0), 2 = (1, 0, 0), 3 = (2, 0, 0), . . .. A multiplication table for the group lists 
the group elements in a row and column, and then fills in the table with the group 
element obtained by multiplying the row and column elements. 



o 



e a b 



e 



e a b 



a 



a 



(7.68) 



b 



b 



Since there are 27 elements, the order of the group is N$j.^ = = 27. If we apply 
the multiplication rule to these elements, then we get the following multiplication 
table. 
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/ 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 \ 

2 3 1 14 15 13 26 27 25 11 12 10 23 24 22 8 9 7 20 21 19 5 6 4 17 18 16 

3 1 2 24 22 23 18 16 17 12 10 11 6 4 5 27 25 26 21 19 20 15 13 14 9 7 8 

4 23 15 7 26 18 1 20 12 13 5 24 16 8 27 10 2 21 22 14 6 25 17 9 19 11 3 

5 24 13 17 9 25 20 12 1 14 6 22 26 18 7 2 21 10 23 15 4 8 27 16 11 3 19 

6 22 14 27 16 8 12 1 20 15 4 23 9 25 17 21 10 2 24 13 5 18 7 26 3 19 11 

7 17 27 1 11 21 4 14 24 16 26 9 10 20 3 13 23 6 25 8 18 19 2 12 22 5 15 

8 18 25 11 21 1 23 6 13 17 27 7 20 3 10 5 15 22 26 9 16 2 12 19 14 24 4 

9 16 26 21 1 11 15 22 5 18 25 8 3 10 20 24 4 14 27 7 17 12 19 2 6 13 23 

10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 1 2 3 4 5 6 7 8 9 

11 12 10 23 24 22 8 9 7 20 21 19 5 6 4 17 18 16 2 3 1 14 15 13 26 27 25 

12 10 11 6 4 5 27 25 26 21 19 20 15 13 14 9 7 8 3 1 2 24 22 23 18 16 17 

13 5 24 16 8 27 10 2 21 22 14 6 25 17 9 19 11 3 4 23 15 7 26 18 1 20 12 

14 6 22 26 18 7 2 21 10 23 15 4 8 27 16 11 3 19 5 24 13 17 9 25 20 12 1 

15 4 23 9 25 17 21 10 2 24 13 5 18 7 26 3 19 11 6 22 14 27 16 8 12 1 20 

16 26 9 10 20 3 13 23 6 25 8 18 19 2 12 22 5 15 7 17 27 1 11 21 4 14 24 

17 27 7 20 3 10 5 15 22 26 9 16 2 12 19 14 24 4 8 18 25 11 21 1 23 6 13 

18 25 8 3 10 20 24 4 14 27 7 17 12 19 2 6 13 23 9 16 26 21 1 11 15 22 5 

19 20 21 22 23 24 25 26 27 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 

20 21 19 5 6 4 17 18 16 2 3 1 14 15 13 26 27 25 11 12 10 23 24 22 8 9 7 

21 19 20 15 13 14 9 7 8 3 1 2 24 22 23 18 16 17 12 10 11 6 4 5 27 25 26 

22 14 6 25 17 9 19 11 3 4 23 15 7 26 18 1 20 12 13 5 24 16 8 27 10 2 21 

23 15 4 8 27 16 11 3 19 5 24 13 17 9 25 20 12 1 14 6 22 26 18 7 2 21 10 

24 13 5 18 7 26 3 19 11 6 22 14 27 16 8 12 1 20 15 4 23 9 25 17 21 10 2 

25 8 18 19 2 12 22 5 15 7 17 27 1 11 21 4 14 24 16 26 9 10 20 3 13 23 6 

26 9 16 2 12 19 14 24 4 8 18 25 11 21 1 23 6 13 17 27 7 20 3 10 5 15 22 
(^27 7 17 12 19 2 6 13 23 9 16 26 21 1 11 15 22 5 18 25 8 3 10 20 24 4 14 y 

(7.69) 

An important feature of this table is that it is not symmetric, which reflects 
the fact that the group is not commutative (non-abelian). 



Commutative Subgroups 

This group has at least two types of commutative subgroups, the cyclic sub- 
groups, and the maximal commutative subgroups. The commutative cyclic sub- 
groups are generated by powers of a given element of the group. 

j times 

Sg = {g^ = 'gTJ^^-^ |j = 1, 2, . . . , n} (7.70) 
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Because of the way in which it was constructed, this is a commutative subgroup. 

ji times j2 times 

g^' o 5-^' = 'gog^---ogo'gog'>---og (7.7i) 

31+32 times 

= 'gr^^^-—^ (7.72) 

32+31 times 

= '^ZJ^^-—^ (7.73) 

j2 times j'l times 

= 'gogo...ogo'gogo...og (7.74) 
= ^^'2 o ^J'l (7.75) 

Notice that if n is not a prime number, then it is possible to construct cychc sub- 
groups with less than n elements. For example, if n is even, then it is possible to 
create a cyclic subgroup with n/2 elements, by the proper choice of the generating 
element g. In the following, we assume this is not the case, and that the cyclic 
subgroups that we use do in fact have n elements. 

Two examples of cyclic subgroups include the "g-axis" and "p-axis" in phase 
space. If you think of the z components as forming a lattice in phase space, then the 
q and p axes of phase space form two separate subgroups. These subgroups 5(1^0,0) 
and 5'(o,i,o) are generated by the elements (1,0,0) and (0,1,0). For example, the 
g-axis is 

^(1,0,0) = {(0, 0, 0), (1, 0, 0), (2, 0, 0), (3, 0, 0), ... (n - 1, 0, 0)} (7.76) 

Another example of a cyclic subgroup of this group is the center of the group. The 
center is set of elements which commute with all elements of the group, and in this 
case is formed by the A axis, which is generated by the element (0, 0, 1). 

^A = ^(-5n) = {(0,0,A)|AeZj (7.77) 

This subgroup is in fact a normal subgroup of S^n- This means that 

V/iei3,, hSxh-^ = Sx, (7.78) 
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FIG. 7.2: The elements of the subgroup are circled. These elements commute with 
all other elements of the group, and thus form the center of the group. 

or in terms of a specific element of the center, 



The second type of commutative subgroup is the maximal commutative sub- 
group. A maximal commutative subgroup is one to which to which you cannot add 
any more elements of the group, and have the subgroup remain commutative. One 
can prove that, for the group we are considering, a maximal commutative subgroup 
is a direct product of a cyclic subgroup and the center. These sets have the form 



Any additional element that one might add to this subgroup will not commute with 
all of the elements already in the subgroup. Interestingly, this subgroup is also a 
normal subgroup, V/i G ^„,, hSgh~^ = Sg. 



(7.79) 



Sg = Sg® Sx, g ^ Sx- 



(7.80) 
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Irreducible Representations 

The finite Heisenberg-Weyl group Sjn has inequivalent 1- dimensional irre- 
ducible representations, and n — 1 inequivalent n- dimensional irreducible represen- 
tations. 

We can check that these are all of the irreducible representations by using the 
equation "^n] = Nf,^ (Section 2.5, Corollary 2(a) in 2^), i-e., the sum of the 
squared dimensions of the irreducible representations equals the order of the group. 
So for the group i^n we have 



E 



■ 1^ + (n - 1) ■ 



n 



(7.81) 



which is equal to the number of elements in this group, Ng^^ = n^. These are 
therefore all of the possible unitary irreducible representations. 

The 1-dimensional irreducible representations are labeled by a pair of integers 

u, V e zl 

Qu,v{q,P, A) = exp (^^{uq + vp)^ . (7.82) 

The representation with [u, v) = (0, 0) gives the trivial representation, where every 
element of the group maps to 1. There are of these representations, so for n = 3, 
we have 3 x 3 = 9 of these one dimensional irreducible representations. 

The n- dimensional irreducible representations involve shifts and multiplication 
by a phase. In matrix form, these can be written as 



M.P. A) = exp (^(A + qp)^ T^-^S^ 



(7.83) 



where a G Z„\{0} and S is the shift matrix, and T is the diagonal matrix of 'n}^ 
roots of unity. For example, if n = 3, then we have 



'^0 1 0^ 



1 

1 



and 



1 

e^^*/^ 







347rj/3 



(7.84) 
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These matrices act on the vector space Tip^ = C^. There are, for n = 3, two choices 
for a 7^ 0, so there are two 3-dimensional irreducible representations. The case 
a = was excluded because with a = 0, this representation reduces to a direct sum 
of one dimensional representations. 

Direct computation shows that these are in fact representations of the group, 
and that they are irreducible. Since these representations do not coincide on the 
center, they must be inequivalent representations. (This is a consequence of the 
Stone-von Neumann theorem.) Also, by the dimension count in Equation fl7.8ip . 
these, together with the one dimensional irreducible representations, must be all of 
the irreducible representations of this group. 

The Regular Representation for Sj^ 

We can also write out the regular representation of this group. The regular 
representation acts of the space of functions on the group. 

{pL{h)f){g) = f{h-'og), f:^n^ (7.85) 

For the group we are considering, the function / can be thought of as a vector in an 
dimensional space. We can take as basis vectors delta functions on each of the 
elements of the group. 

1 1 iih = g 
Skig) = I (7.86) 
otherwise 

The regular representation can then be written as a set of x matrices with this 
basis. By looking at the multiplication table, we can figure out how some element g 
maps the elements of the group back into the group, and construct a matrix piig) 
that permutes the basis vectors in the same way. For example, ii g = (1, 0, 0), then 
from the multiplication table in Equation (17.691) we get that (leaving out the zeros) 
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Pl(1,0,0) 



1 

1 

1 
































1 

1 

1 














1 

1 

1 














1 

1 

1 














1 

1 

1 
































1 

1 

1 














1 

1 

1 














1 

1 

1 














1 

1 

1 















(7.87) 

Other 27-dimensional matrices can be constructed for each other element of the 
group. 



Decomposition of the left regular representation 

Recall that the left regular representation is a shift operation on functions on 
the group. It is defined as 



iPLih)f){g) = fih-'og), 



(7.88) 



where f{g) is a function on the group. The set of such functions can be thought 
of as an dimensional vector space = {f : Sjn ^ C}. As described in Section 
\7.5\ we can reduce this representation into a direct sum of the irreducible repre- 
sentations, with multiplicities equal to the dimension of the particular irreducible 
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representation. This reduction is performed by constructing a projection operator 
which projects functions onto the p-invariant subspace associated with the partic- 
ular irreducible representation. This procedure divides the Hilbert space into the 
primary components. These primary components can be further reduced to the ir- 
reducible components by an arbitrary choice of "basis" in the p-invariant subspace. 
As we'll see the primary representations act on functions on phase space, while the 
irreducible representations act on functions on a "coordinate" subspace. 

Canonical Decomposition: the reduction to primaries 

This procedure can be carried out explicitly in this case, using the formulas 
for the irreducible representations in Equations fl7.82p and (17.831) . and the formula 



for the projector 



22I ]. Let's use the characters of the irreducible representation 



defined above, with a = 1, and also specialize to the case n = 3. The projector is 

heG 

Using the explicit formula above for the irreducible representation pi, we can calcu- 
late its character 

XpAa) = 3 (Vo,o) + e^5(o,o,i) + e^5(o,o,2)) (g), (7.90) 

where the delta functions are functions on the group defined in Equation (I7.86p . If 
we think of this character as a function on the group, we see that it is only nonzero 
on the center Sx, where its functional form is given by an irreducible representation 
of the subgroup p : e~'^. 

Using the formula for the character of pi, we see that the projector acting on a 
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given function on the group is 

h£G 

= ('^(O'O'O) + e^5(o,o,i) + e^5(o,o,2)) (h) f{h'^ o g) (7.92) 

= f{9) + e'^/((0, 0, l)-i o ^7) + e¥/((0, 0, 2)-^ o g) (7.93) 
= f{9) + e'^/((0, 0, 2)og)^ e^/((0, 0, 1) o g) (7.94) 

This projector should project any function onto a 9-dimensional subspace of T which 
is invariant under the action of the regular representation. Since the delta functions 
form a basis in JF, we can use them to find a basis of the 9-dimensional subspace. 

{Pi6h){g) = 6h{g) + e'^5,((0, 0, 2) o ^) + e'^5,((0, 0, 2) o g) (7.95) 
= 6h{g) + 5(0,0,-2)0/1 (fi-) + e^5(o,o,-i)oh(5') (7-96) 
= E e'^''^(^,/^.^.-A)(^7) (7.97) 

= J:''^'''-%K^.M9) (7.98) 

= e-'^^5^e--%„,^,,)((7) (7.99) 

Prom this expression it is clear that tlie delta functions for all possible values of h 
project onto only nine linearly independent functions on the group, one associated 
with each point {hq, hp) in the phase space. Each of the delta functions with the 
same phase space component map to the same function, up to a complex coefficient 
given by the A component (see Figure [7^ . 

We can use this projection of the delta functions to define a basis for the 9- 
dimensional invariant subspace. 

/(,,,)(^7) = (Pi5(,,,,o))((7) = ^'"^X^pM (7.100) 
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These functions are an example of the covariant functions described in Section 17.41 
They satisfy the covariance relationship in Equation (17.361) for the commutative 
subgroup S\. This can be checked by calculating how the functions vary under a 
right shift by an element of the center S\. 

/(,,p)(^o(0,0,A)) = J2 e-'^''W)(^^(0,0,A)) (7.101) 

We can use the delta function to move the element (0, 0, A) from the argument into 
the phase. 

go{0,0,X) = {q,p,X') =^ g = {q,p,X'-X) = {q,p,X") (7.102) 
Use the new variable A" = A' — A to simplify the expression above. 

/(,,p)((7o(0,0,A))= Yl e^'^(""+')<5(,,p,A")(^?) (7.103) 

A"GZ3 

A"eZ3 

= e-'^^n,,,)ig) (7.105) 

So the functions f{q,p){g) are elements of the subspace S{Sx, t), 

T D S{Sx,r) = {f:yheS,,yge GJigoh) = T{h)f{g)}, (7.106) 

where r is the irreducible representation of the center given by 

r((0,0,A)) = e-¥\ (7.107) 

As shown in Equation (17.371) . any function in the space S{Sx,t) can be described 
by the values that it takes on the elements of the factor space Sjn/S\. This can be 
written fairly simply by using the functions f{q,p){g) as a basis. For any G S{Sx, r), 
we can expand it on the basis. 

<Pi9) = Y.''(^^p)f(^^M (7.108) 
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FIG. 7.3: The basis functions of the primary subspace are delta functions along fibers 
over points in phase space. Functions in this subspace can have arbitrary values for 
different points in phase space, but their values in the A direction are determined by the 
particular irreducible representation associated with this invariant subspace. 
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The points {q,p) label the equivalence classes in Sjn/Sx, and we can think of the 
expansion coefficients function on the space of equivalence classes. 

The invariance of the subspace S{Sx,t) under the action of the left regular 
representation can be shown by computing the action of Pl(^) on an arbitrary basis 
function, for any h in the whole group. 

{pL{h)f^,,,)){g) = f^,,,){h^'og) (7.109) 
= J]e-¥^5(,,p,A)(/^^'o^) (7.110) 

e 3 d(hg+q,hp+pM+^+ihqP-hpq)){9> (7-111) 

AeZg 

/27rz \ 

= exp ( -^(^A + V - hpq)j f(^g+h„p+hp){9) (7.112) 

So, the regular representation permutes the functions /(g,p) among themselves, and 
multiplies them by a phase. Therefore, S{Sx, r) is invariant under the action of the 
regular representation. Even though we used an explicit choice of basis functions 
(the f(q^p){g) functions), this is actually not necessary in order to reduce the regular 
representation. All the information about this invariant subspace is contained in 
the projection operator Pi which we used to construct this basis. The restriction of 
the left regular representation to the subspace S{Sx,t) is a primary representation 
associated with the irreducible representation pi. This primary representation acts 
on functions on phase space, i.e., it acts on functions which are defined by their 
values C(g^p) at each point (g,p). 

The 9-dimensional invariant subspace S{Sx,t) can be further reduced into its 
three invariant 3-dimensional components. However, this reduction is arbitrary, 
since there is a choice of basis for the 3-dimensional subspaces which must be made. 
This choice is related to the choice of a larger commutative subgroup (see Section 
17. 4p which is necessary for the following reduction. 
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Reduction of the regular representation to irreducibles 

As observed in Section (see Section [73]), if we want to reduce the regular rep- 
resentation the most, we can do this by finding the largest commutative subgroup. 
For the group Sjn that we are considering, the maximal commutative subgroups are 
formed as a direct sum of the center and another cyclic subgroup. There is some 
fiexibility in choosing the cyclic subgroup, which is refiected in fact that reduction 
of the regular representation to irreducibles is not unique. In this section, we will 
choose a cyclic subgroup which lies in the phase space component of the group. Pick 
an element of the group which is in the phase space component, Qz = (z,0). The 
point z 7^ (0, 0) can now be used to generate a cyclic subgroup. 

Sz = {gz,gzOgz,...,g:} (7.113) 

This choice of an element in phase space splits the phase space into separate compo- 
nents. If for example we think of the element z as a momentum, then the subgroup 
Sz would be the momentum axis, or "p"-axis. All other elements of phase space 
(z', 0) will have some "position" component, so they will not commute with the 
"momenta" in Sz- 

We can now use the subgroup Sz to construct a maximally commutative sub- 
group. Form the direct sum subgroup Sz = Sz Q) Sx,z ^ 0. This subgroup is a 
maximal commutative subgroup, with r? elements. It can be thought of as the set 
of elements in the group which form the plane through the group which lies over 
the "z"-axis. For example, if z = (0, 1), this set is the Ap-plane, as shown in Figure 

dzai). 

This commutative subgroup has 1-dimensional irreducible representations 

P{a,p)ijz, A) = exp ( — ("j + /5A) j , (7.114) 
where {a, (3) G Z„ © We would like to use one of these representations as a 
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Equivalence classes under 5j 



2 




FIG. 7.4: The elements of the subgroup Sp — Sp(BSx lie in the Ap-planc. which is shaded 
dark grey. The two planes parallel to it are the equivalence classes in the factor group 
f)/Sz- These classes are labeled by the value of q, which is distinct in each class. Note 
that in contrast to the gp-plane, the Ap-plane is not a "phase space" since all elements 
in the Ap-plane mutually commute. 
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starting point for constructing an irreducible representation of the group. 

In the reduction of the regular representation to primaries, we got functions 
that only depended on the phase space {q,p) component of h. That means that 
they only depended on the class of h in Sjn/Sx. Similarly, we will now consider 
functions which only depend on the class of h in Sjn/Sz- Such functions can be 
found by considering the space of function which are covariant with respect to right 
shifts by elements of the subgroup S^. 

S{S,,Pia,i3)) = {f:'^he S,,\/g e GJigoh) = PiaMh)fi9)} (7.115) 

This space is invariant under the action of p^, and it is a subspace of the space 
S{Sx,t), if we choose the representation P(a,i3) of which corresponds to the rep- 
resentation r of ^A- 

We would now like to define a set of basis functions for S{Sz, P{a,i3))- As in 
the reduction to the primary representations, these basis functions can be labeled 
by the right equivalence classes in Sjn/Sz- First pick a set of representatives k for 
the classes. This set is a choice of n group elements, one in each of the different 
equivalence classes. In particular, we will choose /c's that form a cyclic subgroup. 
This can easily be done by taking k G S^,^, where ci;(z,z_|_) 7^ 0. This means that 
we are choosing the the elements k to be in the phase space, i.e., they have A = 0. 
While this choice is not necessary, it makes the following construction simpler. 

Now define the basis functions analogous to the functions f{q,p) in the previous 
section. 

Mq) = X] P{o^,P)i.h)koh{g) (7.116) 

Since there are n different equivalence classes, there are only n of these functions, so 
the regular representation restricted to this subspace is actually an n dimensional 
irreducible representation, as long as we can show that this is an invariant subspace. 
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Actually, all we need to do is show that these functions satisfy the covariance re- 
lationship in Equation (17.1151) . If they do, then by construction, they belong to a 
subspace which is invariant under the action of the left regular representation. 

Let's check the covariance of these functions by directly computing how behave 
under a right shift by an element of the subgroup Sz- 

Mg oh') = Yl Pic'M'^)^koh{g o h') (7.117) 

Now use the delta function to rearrange the group elements. 

koh = goh! =^ koh" = g (7.118) 

Here we made the change of variables h" = h o {h')~^ = h — h' . Since these are 
elements of a commutative subgroup, we are allowed to turn the multiplication by 
the inverse into ordinary subtraction. Insert these results back into the summation. 

M9 oh')= J2 P(-Mh" + h')6k,h"{9) (7.119) 

Pia,f3){h')6koh"ig) (7.120) 

= PiaMh")M9) (7.121) 

This shows that the functions (pkig) do actually belong to the set S{Sz, P{a,i3))- Also, 
since there are n of these functions, and they are linearly independent (a property 
inherited from the delta functions) they form a basis of the n-dimensional space 

<S{Sz,P{a,l3))- 

Now that we have identified the invariant subspace S{Sz, P{a,f3)), we can con- 
struct an irreducible representation p = Pl\s{s- ^)) restricting the left reg- 
ular representation to this subspace. The exception to this is the case where 
(a,/5) = (a,0). In this case, the reduced representation p is a reducible permu- 
tation of the elements of S{Sz, P{a,i3))- This could be shown by direct calculation. 
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but is easier to see using the Stone- von Neumann theorem. When /3 = 0, the 
representation p maps the elements of the center to the identity operator. Then, 
by the Stone-von Neumann this representation is unitarily equivalent to the triv- 
ial representation, where all elements map to the n-dimensional identity operator. 
Therefore, since the n-dimensional identity operator is reducible, p is also reducible. 

The reduction of the regular representation to n-dimensional irreducible repre- 
sentations performed above contains an arbitrary choice of basis. This can be seen 
in the choice of the subgroup we used to construct this basis. There is no reason 
to choose one element z over any other, and a different choice of z component can 
give us a different reduction. However, according to the Stone-von Neumann the- 
orem, this different reduction will give the same irreducible representations, up to 
a unitary transformation, since it maps the center of the group to the same set of 
operators. 

Example of the reduction to irreducible representations 

As a concrete example of the reduction to irreducible representations, take the 
discrete Heisenberg-Weyl group with n = 3. Consider the subgroup Sp, shown in 
Figure fl7.4l) . This subgroup contains the 9 points in the pA-plane, (0,p, A). Take 
the representation with = (0,2), which, because n = 3, is equivalent to 



The basis functions are written as a sum over all values of p and A, and can be 
labeled by the q value of points in the equivalence class. 



(a,/?) = (0,-1). 




(7.122) 




(7.123) 




(7.124) 
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Here we have used the three elements on the q axis to represent the classes, 

5, = {(0,0,0), (1,0,0), (2, 0,0)}. (7.125) 

Using these as representatives of the classes Sj/Sp means that an arbitrary element 
of the group can be written as 

h = {h„ hp, hx) = (%, 0, 0) o (0, hp, hx - h,hp). (7.126) 

We now want to see how the regular representation decomposes on the basis we have 
chosen. The decomposition of group elements given above is used several times in 
the following in order to move the element h from the argument of the delta function 
into the parameter of the delta function. 

{pL{h)ct>,){g) = cl>,{h-^ o g) (7.127) 

— J '^(<?,o,oMo,p,A) {h^^ o g) (7. 128) 

= X^X^exp i — — A j 5(g+h„o,o)o(o,p+hp,A+/iA-/ip(^+2Q))(o5') (7.129) 

Now change the variables in the summation to simplify this expression. 

{pdh)^q){g) = X^X^exp (^-'^{\' -hx + hp{hq + 2q))^ 

,0,0)o(0,):3',A') 

(7.130) 



p' X 



= exp f — (/^A - hp{h, + 2q))\ 0,+^(^?) (7.131) 

This reduction of the regular representation cannot be reduced any further. This 
representation is a discrete version of the standard Schrodinger representation of the 
continuous Heisenberg-Weyl group. 

We can write this representation as a matrix representation, where we use the 
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functions (f)q as the basis vectors. 



Ma) 



(9) 



(7.132) 



Using this basis, the matrices S and T from Equation (17.841) can be used to write 
the reduced representation as 



(Rq(/i))0, = e¥('^A+/^p^)T-2^s-^ 



(7.133) 



So here it is, an concrete example of how the regular representation can be reduced 
to an irreducible representation. 



Metaplectic Transformations and the Discrete HW Group 

The discrete Heisenberg-Weyl group that we are considering is an ideal context 
in which to illustrate the Stone-von Neumann theorem. From the perspective of a 
physicist, perhaps the most interesting thing about the Stone-von Neumann theorem 
is the importance it gives to phase space instead of configuration space. Mathemat- 
ically, phase space is related to the primary representations of the Heisenberg-Weyl 
group, and this is good since the reduction of the regular representation to primaries 
is well defined. On the other hand configuration space is related to the reduction of 
primary representations to irreducible representations. This reduction involves an 
arbitrary choice of a particular commutative subgroup in phase space to use as the 
configuration space. Since this choice of configuration space is not unique, it is not 
the most natural space to consider. A unitary transformation of the representation 
will take you from one choice of configuration space to a different choice, so any 
choice you make is not really any better than any other. 

In the case of the discrete Heisenberg-Weyl group these ideas are readily il- 
lustrated. The primary representation acts on functions on phase space. Writing 
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the phase space component as z = {q,p), the primary representation in Equation 
(17.1121) becomes 

(p(^',A)/(.))(<7) = exp i^—(X + u;{z,z'))j f(-.^.,){g). (7.134) 

Further reduction of the regular representation required choosing a maximal com- 
mutative subgroup Sz of i3 and a set of labels for the equivalence classes in S^/S^- 
In Equation (17.1231) we chose 5*^ as the maximal commutative subgroup, and we 
chose the g-axis as the labels for the classes in 9)/Sp. This gave us a set of functions 
labeled by q on which the regular representation could be reduced. We obtained the 
representation in Equation (I7.13ip . 

{pL{h)<P,)ig) = exp {—{hx - K{h, + 2g))J (7.135) 

If instead we use the subgroup Sq = SxQ) S(^g^o,o) to decompose the group, and label 
our functions by elements along the p-axis, then we would get the representation 

{pL{h)(Pp)ig) = exp i-Y{hx + hq{h, + 2p))\ <t)p+hM- (7-136) 

Using the basis, the matrix form of this representation is 

(Rp(/i))<^p = exp f — (/iA - Khp)\ T^'^'S-^''^,. (7.137) 

The representations Rg and Rp both map elements (0, 0, A) of the center to 
multiplication by the phase exp(^A). The Stone-von Neumann theorem then tells 
us that there is a unitary transformation which converts one of these representations 
into the other. In this case the transformation which does this is related to the 
discrete Fourier transformation. We can write the discrete Fourier matrix as 

[Wk. = expf^(jA;)V (7.138) 



^ 3 

The Fourier transform takes the g-axis to the p-axis by rotating phase space by 90°. 
Another application of the Fourier transform would give another rotation, which is 
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the same as flipping the sign on both axes. Since we don't want to change the sign 
of the axes, we might expect that we need to use the inverse Fourier transform to 
relate the two representations Rg and Rp. Direct calculation confirms that this is 
correct. 

Using the definition of W, it can be shown that the 3x3 matrices W"-*-, S, 
and T satisfy certain commutation relations. 

g-i . W"^ = W"^ ■ (7.139) 
X-i . = W"^ ■ S (7.140) 
S-T = e^T-S (7.141) 

Using these relations, we can show that the matrix representations Rg and Rp are 
related as follows 

Rp . = W"^ ■ Rq. (7.142) 

So the inverse Fourier transform given by the matrix relates the two represen- 
tations. It is the unitary transformation which we knew must exist as a consequence 
of the Stone-von Neumann theorem. If the representations Rg and Rp were con- 
structed using different maximal commutative subgroups, they would still be related 
by a unitary transformation. Instead of the Fourier transform matrix, however, the 
unitary transformation would be a general metaplectic transformation. 

7.7.2 The Continuous Heisenberg-Weyl Group 

The continuous Heisenberg-Weyl group, ^ is a generalization of the discrete 
group discussed in the previous section. As a set, is composed of a phase space 
component, and a additional one-dimensional degree of freedom which encodes the 
commutation relations. So its elements are points in M^""'"'^. 

Sj:{g = (z,A)|z = (q,p) G M" X M"; A e R} (7.143) 
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Here n is the dimension of the configuration space. For an particle system in 3 
dimensional space, n = 3N. The group product law analogous to the discrete case, 
except that the operations are now vector operations. 

g'og = {z' + z,\' + \ + ^u{z', z)) (7.144) 

and u is again the symplectic product 

uj{z, z) = q'^ • p - ■ p = z'^ ■ J"^ ■ z. (7.145) 

Commutative Subgroups 

As in the discrete case, the continuous Heisenberg-Weyl group has a variety 
of commutative subgroups. The center of the group is again formed by the A axis. 
Each element (0, A) commutes with all elements of the group, and the set of such 
elements forms a commutative subgroup which is isomorphic to the group of real 
numbers with addition, (M, +). 

There are also subgroups which can be generated by a given element of the 
group. Take any element (z. A). Then consider the set generated scalar multiples of 
this element. 

Sg = {{cz,cX)\ceR} (7.146) 

This set is a line through the origin in M^""*"^, and it is a commutative subgroup. 

More interesting commutative subgroups can be formed by taking direct prod- 
ucts of subgroups of the form Sg, where g = (z, 0) has no A component. This group 
then is a line in the 2n-dimensional phase space. Consider a maximally commut- 
ing, linearly independent, set of vectors Zj in phase space. Since phase space is 
2?T,-dimensional, we can pick n different vectors such that u;(zj, z^) = 0. These vec- 
tors can be used as basis vectors for a Lagrange plane in phase space. A Lagrange 
plane is a maximal commutative subgroup contained in phase space, and which can 
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be used as a "configuration space" . Then form tlie direct sum of the commutative 
subgroups they generate, 

n 

Sl = Qs^^. (7.147) 

i=l 

Because (cjZj,0) o (cjZj,0) = (cjZj + CjZj,0), this is also a commutative subgroup 
of S). We can think of this subspace as a configuration space, and the vectors {zj} 
as the various directions in this configuration space. For example, any point in the 
subgroup 5*^ could be written 

q = ^g,Zi. (7.148) 

i 

Any additional linearly independent vector z' that we might try to add to this set 
will have a nonzero symplectic product with at least one of the Zj's. This means 
that any additional vector contains a contribution from the conjugate momentum 
of at least one of the Zj's. 

Irreducible Representations 

The irreducible representations of the Heisenberg-Weyl group $^ come in two 
flavors. First, as with the discrete group, there are one dimensional representations. 

^'(u,v)(q, P, A) = exp(?(u ■ q + V ■ p)) (7.149) 

The parameters u and v are n-dimensional vectors, so these representations form a 
2?T,-dimensional (continuously infinite) family of one dimensional irreducible repre- 
sentations. 

There is also a family of infinite dimensional irreducible representations, which 
are analogous to the n-dimensional irreducible representations of the finite Heisenberg- 
Weyl group. These infinite dimensional irreducible representations give the standard 
Schrodinger representation of the Heisenberg-Weyl group, which maps the group to 
operators acting on functions on configuration space. 

(pa(q',p',A)^)(q) = e*°("-5P'-'i'-P'-'i)V;(q + q') (7.150) 
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Here, the function ^/^ is a complex function of n variables. 

Reduction of the Regular Representation 

The reduction of the regular representation to primaries gives shift operators 
on phase space. 

(^,(z', A)/) (z) = e-(^+l-(^''^))/(z + z') (7.151) 

Reducing this gives the irreducible representations pa in Equation (17.1501) . which 
act on functions of q. An alternate basis for the decomposition would give operators 
which act on functions on momentum space, p. 

[pM. P, A)^) (p) = e^-(^+^P'-'''+P-'^')^(p + pO (7.152) 

These representations are related to each other by a Fourier transform (and an 
inversion and scaling of the p axis), which can be shown by explicit calculation. 

j ciqe^P-'i(p«(z',A)^)(q) (7.153) 

= y"rfqe*P-V"(^-^P'-'i'^P'-'iV(q + q') (7.154) 
_ gia(A-lp'.q') y"(iqe^P-ie^^"P'-i^(q + q') (7.155) 
,iQ:(A-|p'-q') / P*(p-°p')-{q"-q'),/,('n" 



^ gia(A-ip'.q')g-i(p-ap')-q' j rfq" e^(P""P')-'i"7/;(q") (7.157) 
= e'"(^+^p'-q')e-^P-q'^(p _ ctp') (7.158) 
= e^"(^+^P'-'i'+P-'i')'(/^(p + p') (7.159) 

In the last line, the substitutions p — ap, and V'(p) ~^ V^(~p/q^) Put this in the 
form of Equation fl7.152p . 

This transformation is an example of the Stone-von Neumann theorem applied 
to the continuous Heisenberg-Weyl group. Since the representations in Equations 



135 

(17.1501) and (I7.152p have the same action on the center of the group, then, by the 
Stone-von Neumann theorem, these are equivalent irreducible representations. This 
means that there must exist a unitary transformation which relates them, and the 
previous calculation shows the transformation. 

In general, there are infinitely many ways to express the irreducible represen- 
tation from Equation (I7.150p . Each realization requires a choice of the position 
and momentum subspaces (the Lagrange manifold) in phase space. As is known 
from classical mechanics, different choices of coordinates in phase space are related 
to each other by linear canonical transformations. Explicitly, the linear canonical 
transformation can be written in terms of a 2n x 2n transformation matrix with 
unit determinant. 









1 u\ 
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(7.160) 



Alternatively, this transformation can be written in implicit form, using the gener- 
ating functions for canonical transformations. The generating function -Fi(Q, q) for 
this transformation is 

i^i(Q, q) = ^ (d|Ql' - 2q^ ■ Q + a|q|2) . (7.161) 

The unitary transformation which relates the irreducible representations is called 
the metaplectic transformation, and, using this generating function, can be written 

^(Q) =^f J rfqe'^^(^''iV(q)- (7.162) 

We can check that this transformation gives the correct transformation of the op- 
erators q and p. The new "position" operator can be written in terms of the old 
position and momentum operators. 

Q = aq + hp (7.163) 
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In the q representation, this is a combination of multiphcation by q and derivative 
with respect to q. 

g = a-q + b- (-iVq) (7.164) 

Put this into the integral in Equation (17.1621) in order to find Q in the Q represen- 
tation. 

g^(Q) =M j rfqe'^^^^-'^na ■ q - ib ■ Vq)^(q) (7.165) 

Use integration by parts to act with the derivatives on the phase. 

g^(Q) =U j (iq^(q)(a ■ q + ib ■ \/ ^)e'^'^^'"'^ (7.166) 
= U j rfq^(q) (a ■ q - b ■ (VqFi)) e'^'^^'"^^ (7.167) 

Evaluating the derivative of the generating function gives the result we wanted. 

gV^(Q)=Ar j dqV^(q)(a-q + Q-a-q)e'^i(Q''i) (7.168) 
= M j rfqV^(q) (Q) e'^^(^''i) (7.169) 
= Q?/;(Q) (7.170) 

Similar calculations show that the new "momentum" operator is the expected deriva- 
tive. 

P = cq + dp^-iVQ (7.171) 



CHAPTER 8 



Symbols of Operators and the 
Noncommutative Fourier Transform 

8.1 From Operators to Functions 

In Part I, the phase space perspective of wave problems was described and 
shown to provide a powerful perspective for analytical solution of wave problems. 
At its core, this perspective relied on being able to convert a generic wave equa- 
tion from an operator into a function on phase space. For wave problems, this 
gives the dispersion function, which encodes all information that is contained in the 
wave operator. With the dispersion function it is possible to define the dispersion 
surfaces. These surfaces give a graphical representation of the allowed frequencies 
and wavenumbers for the various wave modes that could be solutions to the wave 
equation. Also, for multicomponent problems, the dispersion surfaces can help to 
identify mode conversion regions through "avoided crossing" type structures, and 
other types of near-degeneracies. 

In this chapter, we review the mathematical tools needed to convert an arbitrary 
operator into a function on phase space. This function is called the symbol of the 
operator, and the mathematics of symbols is sometimes called the symbol calculus. 
Various types of symbols have been introduced in the literature, each using different 
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orderings and complexification of the position and momentum operators. 



24| 



138 



^ir]p^iS,q antistandard-ordcrcd ^ir]P+i£,q _ ^za^-z*a standard-ordered ^iS.q^i'nP (8.1^ 



— Z* Ci ZCL^ 

6 6 antinormal-ordered 



The Weyl symbol uses a symmetrized ordering. It thus treats q and p on an equal 
footing, and takes the central place in the above diagram describing how the different 
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types of symbols relate to each other (see Berezin and Shubin 
a similar diagram). While these various symbols have been extensively studied, and 
their relations to each other have been worked out, the resulting formula are often 
presented in a mysterious way as the result of brute-force calculations. By recasting 
the theory of symbols in terms of Fourier transforms on groups, the proper context 
for a deeper understanding can be obtained. 

We start this chapter with a review of the symbol theory developed by Weyl 
in the context of quantum mechanics. We then proceed to a more general theory 
which defines symbols as a type of Fourier transform on noncommutative groups, a 
perspective introduced by Zobin. Finally, the Zobin perspective is used to construct 
the "symbol" of a matrix. Since a matrix is really an operator, it is possible to 
compute a function on a discrete phase space which is the symbol of this operator. 
This example illustrates the ideas of this chapter in a very concrete way. 



8.2 The Weyl Symbol 

In quantum mechanics, the state of a system is given as an abstract vector in 
some Hilbert space. Measurable quantities are then calculated as the expectation 
values of particular operators. The Schrodinger approach to quantum mechanics is 
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to write the state as a wavefunction, with differential operators representing, e.g., 
the momentum of the system. 

ip) = j dq^\q){-id,)^{q) (8.2) 

Wigner, Weyl, and others developed an alternative approach to quantum mechanics. 
Wigner realized that the information about the state of the system could be written 
in what looks like a more classical way, but expressing the state as a function on 
phase space. The Wigner function of a state il){q) is a function on phase space, and 
is defined as 

W{q, p) = jds e^^>t (^^ + ^ (^^ _ . (8.3) 

It turns out that the Wigner function is simply the Weyl symbol of the density 
matrix of the state Given the integral kernel KA{q, q') of some operator A, the 



Weyl symbol of A is defined as [25| 

A{q,p) = Jdse'P'KA^q+'-.q-'-). (8.4) 
For the density matrix, the integral kernel is 

K^{q,q') = {q\^)m') = i^\q)i^{q)- (8-5) 

Inserting this into the equation for the symbol gives us the Wigner function. 

The Weyl symbol has many nice properties which motivate its use in the analysis 
of operators, especially when compared to the integral kernel of the operator. As an 
example, consider the derivative operator. Its kernel is a distribution, not a smooth 
function. 

^f{x) = I dx'5{x~x')^J{x') (8.6) 
dx J dx 

= - j dx' (J^5{x - x')^ f{x') (8.7) 
x-x')f{x') (8.8) 
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Now, if we calculate the symbol of this kernel, we will get a smooth function on 
phase space. This is not too surprising since the definition of the symbol looks like 
a Fourier transform, and we know that the Fourier transform of a delta function is 
a constant. 

D{q,p) = j dse'P'6^^\q + s/2-q + s/2) (8.9) 

dse'P'^S{s) (8.10) 
as 

dsS(s)^e'P' (8.11) 
ds 



...j^sHs),^'.-^ ,8.12) 



This result generalizes. The integral kernel of local, differential operators will gener- 
ically be some singular distribution. However, since the symbol is a kind of Fourier 
transform, the singularities get turned into smooth functions. 

This calculation of the symbol of the derivative also illustrates another feature 
of the Weyl symbol calculus. It enables us to relate quantum operators like p = idq 
with classical functions on phase space, like the classical momentum p. It is also 
possible to go the other way. We can "quantize" a function on phase space by 
inverting the formula for the symbol. This will give us an operator whose action is 
essentially the same as the original operator A. 



A' = j exp{iuj{z,z))A{z)dz (8.13) 

Here the phase space shift operator is given by 

T, = e-^'^(^'^), (8.14) 

where z = {q,p) is composed of the position operator, qf{q) = qf{q), and the 
momentum operator pf{q) = —idqf{q). 
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8.3 The Fourier Transform: A Group Theory Per- 
spective 

8.3.1 Introduction 

This section is based on a series of seminars given by N. Zobin at William 
and Mary in the spring of 2004. The lectures described the group-theoretical basis 
of Fourier transforms, for both commutative and non-commutative groups. In the 
case of commutative groups, the transform that arises is the ordinary Fourier trans- 
form from harmonic analysis. However, when the construction described here is 
used for non-commutative groups, a different sort of Fourier transform is obtained. 
This non-commutative Fourier transform provides the group theoretical basis for 
understanding the symbol of an operator. This group theory point of view, which 
was developed by Zobin, provides the proper context for understanding the various 
types of symbols (Weyl, Wick, anti-Wick, normal, anti- normal, etc.) that have been 
defined in the literature. 

Motivational Example 

The group theory perspective of the Fourier transform is perhaps best explained 
by first starting with a simple example. The ideas presented in this example will 
then extend to a general definition of the Fourier transform. 

Consider the group of cyclic shifts on a set of four points. These shifts form a 
group, G. There are four different shifts which can be applied to the four points. 
They are shifts by zero, one, two, or three elements. As group elements, they can 
be written as 

G={e,g,g\g^}. (8.15) 
Here e is the identity element of the group, which corresponds to shifts by zero points. 
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The one-point shift corresponds to the group element g. Repeated apphcations of 
shifts by one point will give all the other shifts, so they are written as powers of g. 

This is a commutative group, so its irreducible representations are simply 
phases. There are four different irreducible representations, which are labeled by 
the integers k G {0, 1, 2, 3}. 

Pk{g') = e"^^^ (8.16) 

These phases can be written out in the character table, which relates the group 
elements and the characters of the representations. 
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This table has the form of a matrix, where each row corresponds to a representation, 
and each column corresponds to an element of the group. As a matrix, this could 
multiply a vector with four elements. By looking at the labels in the table above, 
it is clear that the elements of the vector naturally correspond to the element of 
the group. So the vector could be thought of as a function on the group, when it 
is considered as a set. Each element in the vector resulting from the multiplication 
corresponds to a row from the matrix, and so can be labeled by the representation. 
This new vector is then a function on the set of irreducible representations of the 
group. 
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(8.18) 
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This matrix equation is in fact the discrete Fourier transform written exphcitly as a 
matrix. These considerations apply to the continuous welL If we start with 

the group of shifts on the real line, construct its the character table, and define a 
multiplication like that in Equation (18.181) above, we obtain the continuous Fourier 
transform. 

It turns out that this type of construction can be extended to the case of 
non-commutative groups. Consider a finite, but non-commutative group G. For 
this group we can construct the dual^ G, which is the set of all equivalence classes 
of irreducible representations. Since the group is non-commutative, some of these 
representations will be matrix representations. However, we can still list them in a 
table like the one above. All groups have the trivial representation where everything 
maps to multiplication by 1. Call that representation po a-nd Put it on the first row 
of the table. The rest of the representations pi are rii x rii matrices. So, the rows of 
the table will be filled with matrices. Each row will have matrices which are all the 
same size, but different rows will have different size matrices. 

e 9i 92 93 

Po 1 111 

Pi idnixni Pl(fi'l) Pl{92) Pl{9-i) 
Pi idn2xn2 P2{9l) P2{92) P2{93) 

Pi idn^xns P3{9l) P3{92) P3{93) 



}1 X 1 

Til X rii 

77-2 X 77-2 



na X rig 



(8.19) 



In the commutative case, this table had an obvious interpretation as a matrix, which 
then multiplied a function over the group considered as a set. Now this is not a 
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matrix, but it can still be "multiplied" by a function on the group. 
1 X 1 { 



rii X rii 
n2 X n2 

ns X n-s 
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B.20) 



This equation is meant to be interpreted in a way analogous to matrix multiplication. 
The "vector" on the left hand side is actually a list of matrices of different sizes. An 
element is of this list is computed by summing over the matrices in the same row 
in the table of representations, with the weights of the matrices in the sum given 
by the appropriate element of the vector f{g). This computation is used as the 
definition of the Fourier transform for a non-commutative group. 



/(P) = E f^3)p{g) 

g&G 



(8.211 



In the commutative case, this summation resulted in a complex-valued function on 
the set of irreducible representations. In this non-commutative case, we have some- 
thing analogous. However, instead of a function, the Fourier transform is operator 
valued. Not only that, but for each p, the dimension of the operator is most likely 
going to vary, as seen in Equation fl8.20p . Such an object is called a section of the 
fiber bundle over the set G of irreducible operators. This object is illustrated in 
Figure flO) . 

The dual fiber bundle is constructed as follows. There is a space of operators 
associated to each point p in the dual, G. This space of operators is GL{T-Cp), 
where Tip is the Hilbert space on which the representation p acts. Put each of these 
spaces of operators over the set G as fibers, with the base point of each fiber at 
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G 

FIG. 8.1: A "section" of the dual bundle. 

the appropriate point p. Even though the fiber is generically a multidimensional 
space, we can think of it in a one-dimensional way, as drawn in Figure flS.ip . The 
collection of all of these fibers is the fiber bundle^ A(G). If we now "slice" the fiber 
by choosing one operator from each fiber, this will create a "section" of the fiber 
bundle. A section is a mapping of the dual G into the bundle A(G'). 

f:G^ A(G) (8.22) 

If we put the elements of G into a list, then the section will look like a list of 
operators, each of different dimension. This is exactly the object that we obtain 
from the equation for the Fourier transform of a non-commutative group. 

In the following sections we will define the commutative Fourier transform 
from the group theory perspective. We will then give examples of various types 
of Fourier transforms for commutative groups, showing how the traditional equa- 
tions for Fourier transforms agree with the group theory perspective. We then 
define the Fourier transform for non-commutative groups and give the example of 
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the non-commutative Fourier transform for the Heisenberg-Weyl group. 



8.3.2 Pontryagin Theorem 

The Fourier transform on commutative groups was summarized in the 1930's 



by Pontryagin in the following theorem 



26|. 



Let G be a locally compact commutative group. Let G be its Pontryagin dual 
group defined by 

peG p:G^T (8.23) 

where p is a continuous homomorphism. Since all of the irreducible representations 
of G are one-dimensional, they are all just characters. So the dual group is the set 
of characters of G. Let be the Haar measure on G and let P be the Haar measure 
on G. Consider the Fourier transform operator 

T : L2{G,dp) LiiG.dP) (8.24) 

defined by 



(^/)(p) = /(p) = j p{9)f{g)dg (8.25) 

G 

This operator has the following properties: 

1. The transform JF is an isometry 

J fi{9)f2{9)dg = J n{p)Up)dP (8.26) 

G G 

2. The inverse of this operator, JF"^, is also a Fourier transform 

3. A convolution can be defined using the Fourier transform so that the transform 
of the convolution is the product of the transforms 

(/W2)(p) = /i(p)/2(p) (8.27) 
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Using this and the definition of the Fourier transform, we obtain an equation for 
the convolution 



{fi*f2){9) = J h{goh-^)h{h)dh (8.28) 

G 



8.3.3 Examples of Fourier Transforms for Commutative Groups 

The Classical Fourier Transform 

The classical Fourier transform is an operator that acts on functions in L2(M, dx). 
Here, M is the group of reals under addition. The Haar measure, dx, for this group 
is simply the Lebesgue measure. The Fourier transform operator is 

: L2{^,dx) ^ L2{^,dx) (8.29) 

defined by 

f{k) = [ e-''''^f{x)dx (8.30) 



with the appropriate normalization. There are several things to notice about this 
transform. 

This transform is an isometry, which in this case means that it is a unitary 
transformation. This isometry property is commonly know as the Plancharel theo- 
rem, or the Parseval theorem. For functions /i, /2 G 1/2(1^7 dx) the Fourier transform 
preserves the inner product of the functions. 

fiix)f2ix)dx= [ hik)Mk)dk (8.31) 



Another thing to notice is that the inverse of this operator JF~^ is also a Fourier 
transform. In this case, the space dual to the group M is also M. This means that 
it also has a Fourier transform, which, up to a sign in the phase, is the inverse 
of the transform in Equation fl8.30p . This Fourier transform takes functions from 
L2(]R, dx) back to the same set of functions. It is quite a special situation, since the 
Fourier transform does not usually do this. 
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Fourier Series 

The Fourier series is another example of a commutative Fourier transform. It 
can be expressed as the Fourier transform on the torus, which is the group of real 
numbers with addition modulo 27i. 

: L2{T,d(f)) L2{Z,dN) (8.32) 

Here functions on the circle with the Lebegue measure d(j) are taken to functions on 
the integers with the counting measure dN. 

Discrete Fourier Transform 

The finite, discrete Fourier transform also falls into the category of Fourier 
transforms on commutative groups. Here, we use the group of integers with addition 
modulo an integer n. The Fourier transform is then 

: L2{Zn, dN) ^ L2{Zn., dN) (8.33) 

Here we use the notation that Z„ is the set of characters of the group Z„. It can be 
shown that the characters of Z„ are given by 

A : Z„ ^ T (8.34) 

where 

A(x) = e2'^^^^ A e Z„, (8.35) 

Therefore, the set of characters Z„ is also a group, and is in fact the group Z„. Since 
this group is finite, the Fourier transform can be written in matrix form as 

:Fk,i = [e'^'%,i&„ (8.36) 

This is the matrix that was obtained in the motivational example for the case where 
n = 4, and shown in Equation (18.181) . 
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An interesting side note about the discrete Fourier transform for the group Z, 
where n = 2k. Consider the homomorphism 6 : Z„ — > Z„ given by 



This is a dilation of the group that effectively breaks the group into the even terms 
and the odd terms. The Fourier transform of Z2fc can then be reduced to the Fourier 
transform on Z^ and the Fourier transform on the group of dilations. This reduction 
is what gives rise to the fast Fourier transform algorithms. An understanding of the 
group theory origins of the FFT algorithms could help lead to the development of 
new types of fast transforms for other groups. 

8.4 Fourier Transforms of Non- commutative Groups 

The Pontryagin theorem nicely defines the Fourier transform for commutative 
groups, but for non-commutative groups the definitions need to be extended, as 
discussed in Section 18.3. 1[ Instead of mapping functions over the group, considered 
as a set, to functions over the dual, the Fourier transform will now map functions to 
sections of the dual bundle. [See Figure (18. 2p ] The rest of the theory proceeds in a 
way completely analogous to the commutative theory described in the last section. 



Let G be a locally compact, but non-commutative group. Let G be the set of 
equivalence classes of continuous irreducible unitary representations of G. So 



where GL{Hp) is the set of operators on some Hilbert space Hp. For the case 
when Tip is not just a finite-dimensional vector space, we should actually consider 
Hilbert- Schmidt operators HS{l-ip). 



Qx = 2x mod k 



(8.37) 



8.4.1 Definitions 




(8.38) 
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Fourier 
Transform 




A function over 
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dual bundle 



FIG. 8.2: The group theoretical basis for the Fourier transform. The Fourier transform 
maps a function over a group, considered as a set. to a section of the dual bundle. If the 
group is commutative, all its irreducible representations are one-dimensional, and so the 
section of the dual bundle can be thought of as an ordinary complex-valued function. 
Otherwise the section is a listing of operators, one for each fiber. 

Now G is not a group as it was for commutative G, since the tensor product 
of irreducible representations is usually reducible. However, sections of the dual 
bundle are still well defined, as described in Section I8.3.1[ 



where HSiTip) is the set of Hilbert-Schmidt operators that act on the Hilbert space 



G 

The Fourier transform can now be defined as a mapping from functions on the 
group to sections of the dual bundle, with the appropriate measures in each case. 



f{p)eV{G,dP):G^HS{np) 




Tip associated with the element p in G. The norm for a section / is defined as by 





L2{G,dg) 



r(A(G),rfP) 




As we might expect from the example in Section 18.3.11 and from the group theory 
interpretation of the commutative Fourier transform, the non-commutative Fourier 
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transform is defined as 

f{p) = J p{9)f{9)dg (8.42) 

G 

where now we have that / is in T{A{G), dP). As in the commutative case, this 
mapping is an isometry, it is invertible, and it can be used to define a convolution 
of functions on the group. 

1. The transform is an isometry 

ri{9)f2{9)d9= [ tr{fl{p)Mp))dP{p) (8.43) 



G 



2. The inverse of this operator JF ^ is also a FT, and it is defined as follows for 
FeT{A{G),dP), 

{T^^F){9) = j tr{p\9)F{p))dP{p) (8.44) 

G 

3. The convolution is defined just as in the commutative case 

{fi*f2){9) = [ fi{9oh-')Mh)dh (8.45) 



G 

In the commutative case the convolution was commutative, since the group ele- 
ments commute. However, this is no longer true in the non- commutative case. 

(/i*/2)(^7)^(/2*/i)(^?) (8.46) 

This point will become important later on when we define the symbol of an 
operator, and will give rise to a non-commutative "star" product for functions 
on phase space. 
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8.4.2 Fourier Transform on the 3-dimensional Heisenberg- 
Weyl Group 

The 3-dimensional Heisenberg-Weyl (HW) group is defined as in Section 

^ = {(z,A)|z e M^A G M} (8.47) 

The measure dg on this group is the Lebegue measure dg = d'^zdX. The set of 
irreducible representations, Sj, includes one-dimensional representations, shown in 
Equation fl7.149p . as well as a family of infinite-dimensional representations, shown 
in Equation fl7.150p . So a section of the dual bundle is a collection of operators 
which come in two different types. The part of the section associated with the one- 
dimensional representations is a mapping of the dual G to the complex numbers. The 
other part of the section is associated with the infinite-dimensional representations. 
This part maps G to operators acting on L2(M). 

As an illustration of the sorts of calculations that can be done using the non- 
commutative Fourier transform, we can compute the inverse transform of a specific 
operator. Consider the shift operator acting on complex functions of one real vari- 
able. 

iS^'f){x) = f{x-x') (8.48) 

This operator lives in the same Hilbert space as the infinite-dimensional irreducible 
representations of the HW group. So, we can use the Fourier transform of the 
HW group to take the inverse transform of this operator. In order to do this, 
we need to embed the operator in a section of the dual bundle. Since there is a 
family of representations which all live in the space of operators acting on L2(M), 
there are many ways to embed the shift operator into a section. The two simplest 
embeddings would be a constant embedding over all possible representations, and 
a delta function embedding over one specific representation. Figure (18. 3p illustrates 
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^HS(L,{\R)) 




^HS{L,{\R)) 




(b) 



FIG. 8.3: Possible embeddings of an operator into a section of the dual bundle of the 
Heisenberg-Weyl group. In (a) the operator is assigned to all representations which have 
the right type of fiber, while those with the wrong type of fiber take the value zero. In 
(b) the operator is assigned to only one representaion pi. All other representations get 



these embeddings with a simple cartoon. 

These two example embeddings can be explicitly written out. The constant em- 
bedding is a section where all of the infinite-dimensional representations get mapped 
to the operator Sx', while all of the finite-dimensional representations get mapped 
to zero. 

{Sx' P = Pa,'^a 
(8.49) 
p = g{u,v) 

The delta function section maps everything to zero, except for one of the infinite- 
dimensional representations, which it maps to the operator Sx'- To make it definite, 
choose the representation pi to map to Sx'- 



A2{p) 



Sx' P = pi 



otherwise 



^.50) 



We can now use the inverse Fourier transform in Equation (I8.44p to calculate the 
transform of these sections. Also, we will make use of the fact that the representa- 



154 

tions look like shifts to write 

5.' = Pa(a;',0,0). (8.51) 

This gives us, with g = {q,p, A), 

{J^-'A,){g) = J tr{p{g)A,{p))dP{p) (8.52) 

tr{p^{g)pa{x',0,0))dP{p) (8.53) 



G 



G 



j tr{p^{q + x',p,\)dP{p) (8.54) 



G 



': (8.55) 

= Ar6{q - x')6{p)6{X) (8.56) 

So the "constant" section gives a delta function on the group. 

The transform of the alternative embedding can also be calculated. 

{J^-'A2){g) = J tr{p{g)A2{p))dP{p) (8.57) 

G 

tr{p^{g)p,{x', 0, 0))5(a - 1) dP{p) (8.58) 



G 



= tr{pi{q + x',p,X) (8.59) 

= Xpi(^7o(x',0,0)) (8.60) 

: (8.61) 

= Afe'^6{q - x')6{p) (8.62) 

So our section which started out localized transforms to a function spread out in A. 
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8.5 The Non-commutative Fourier Transform and 
Zobin's Symbol Theory 

The Fourier transform for non-commutative groups provides the mathematical 
basis for the theory of symbols of operators. While it has been understood that ideas 
from group theory play an important role in the theory of symbols, it was Zobin 
who recognized that the mapping from operators to functions could be described as 
a type of double Fourier transform on groups. 

r{A{G),dP) ^ L,{G,dg)'^ = ''' L2{G,,dg) ^ L,{G,,dg) (8-63) 

The idea of this transformation is this: Start with an operator of interest, A. This 
operator lives in some space of operators, HS{T-C^). There is often then a non- 
commutative group G which has an irreducible representation which also lives in 
the space HS{7i£). So, the operator A can be embedded in a section of the dual 
bundle, call the section Sa{p)- We can now use the theory of Fourier transforms for 
non-commutative groups to transform this section of the dual bundle into a function 
on the group, where the group is considered as a set. The inverse Fourier transform 
!Fq^ is the first transform on the way from calculating the symbol of the operator. 
We now have a function on a group. But, this is really just a function on a set, 
since the function itself is perfectly well defined without the group structure. Now 
consider a new group Gq. This is the group which has the same set of elements as 
the group G. However, instead of the non- commutative group product rule from G, 
the group product for Gq is a commutative rule. So the function that was obtained 
from the inverse Fourier transform can actually be thought of as a function on the 
commutative group Gq. The commutative version of the Fourier transform for the 
group Go can now be applied to obtain a function on the dual, Gq. This function 
on the dual is the Zobin symbol for the section Sa{p)- 
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Since this procedure relates functions to operators, it can be thought of as a 
type of "quantization" mapping. A function is a "classical" object, and this double 
Fourier transform lets us find a corresponding operator, or "quantum" object. We 
will sometimes refer to this double transform as a quantization, and write it as 

Q = Tgo Tel ■ ^2(Go, dg) ^ r(A(G), dP). (8.64) 

The Zobin symbol of an operator is then just the application of the inverse of this 
quantization mapping. For an operator A embedded in a section ^^(p), write 

A{g) = {Q-'Sa){9) (8.65) 

for the symbol of A. 

8.5.1 The Zobin Symbol Theory for the Heisenberg-Weyl 
Group 

A straightforward and useful example of the Zobin transform is the case of the 
double Fourier transform for the continuous Heisenberg-Weyl group. As a set, this 
group is composed of a 2n-dimensional phase space, and one additional dimension 
which is used to record the canonical commutation relations for the elements of 
phase space. 

^ as a^set ^2n+l ^q qq^ 

As described in Section f l7.7.2l) . the group product law is almost just the commutative 
addition of elements of M^"'^^. 

91 092= (^zi + Z2, Ai + A2 + ^uj{zi, Z2)^ (8.67) 

The set M^"+^ also has a natural commutative group structure, that of ordinary 
addition. So, the double Fourier transform will be an inverse transform from the 
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dual to the Heisenberg-Weyl group, followed by an ordinary 2n + 1- dimensional 
Fourier transformation. 

r(A(^),dP) L2(^,rf(7) " = ''L2(M2«+i,rf2n,+i^) L2(S2"+i,rf2"+ia;) 

(8.68) 

The irreducible representations of the Heisenberg-Weyl group are given in Equa- 
tions fl7.149p and (17.1501) . There, a splitting of phase space has been chosen, which 
specifies the "position" and "momentum" coordinates in M^*^. Since there are two 
types of representations, the dual space Sj has two components. The first, which is 
composed of the one-dimensional representations from Equation (17.1491) . turns out 
to have zero Plancharel measure, and so we can neglect it when forming sections of 
the dual bundle for transforming. The second component of the dual is given by the 
one parameter family of infinite-dimensional representations in Equation (I7.150p . 
reproduced here for convenience. 

p„(q', P, A)^(q) = e-(^-|p'-'i'-P'-'i)V^(q + q') (8.69) 

Since these representations are parametrized by real, nonzero a, the dual is iso- 
morphic as a set to the real line M/jO}. These representations are operators which 
act on functions on configurations space, so for each a the fiber is the space of 
operators iJS'(L2(M")). A section of the dual bundle is therefore, up to the set 
of measure zero associated with the one-dimensional representations, a choice of 
Hilbert- Schmidt operators, one for each a ^ 0. 

8.5.2 The Weyl Symbol as a special case of the Zobin Trans- 
form 

The Weyl symbol of an operator can be thought of as a special case of the 
Zobin transform for the Heisenberg-Weyl group. If the operator is embedded in a 
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specific section of tlie Heisenberg-Weyl dual, tlien its transform will be restricted to 
a 2?7,-dimensional space which can be thought of as phase space. This relationship 
between the Weyl symbol and the double non-commutative Fourier transform can 
be demonstrated by an explicit calculation, performed as follows. 

Take an operator of interest. A, and embed it in a section of the dual bundle, 
as in Equation (18.501) . 

{A if a = 1 
(8.70) 
otherwise 

With this embedding the section is only non-zero for the representation with a = 1, 
which in the q representation, is the operator given in Equation (17.1501) . 

pi(q', p', A')^(q) = e^(^'-^P'-'''-P'-''V(q + q ) (8.71) 
Then, the non-commutative inverse Fourier transform of A2{p) is 

A{g') = J tT{p\{9')SM) dP{p) = tT{pl{9')A) (8.72) 

G 

Then the symbol of the section A2{p) is given now given by the commutative, 2n + l- 
dimensional Fourier transform of A{g'). We think of the elements g' of the group as 
simply points in 2n + 1-dimensional space, and write the Fourier transform in such 
a way as to be invariant under changes of splitting. Also, the trace in A{g') can be 
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explicitly written as an integral in the q representation. 



A{z, \)= j d2«;2'rfAV"("'"')+*^^'i(2', A') (8.73) 

G 



G 



(8.74) 

= j e'^'^^-'^dX' j e'('i-P'-P-'i'+^P'-'i'+P'"i")(q" + q'|i|q'VVf^Vf^V 

(8.75) 

= (5(A - 1) J + e^^P-^^^q" + q'|i|q'V"g"rf"g' (8.76) 

= 5(A-1) y"e-'P-^'(-q+q72|i|-q-q72)rfV (8-77) 

From this we see that, up to an inversion of the q axis, the symbol as calculated by 
using the double Fourier transform agrees with the definition of the Weyl symbol 
[Equation (18. 4p ] on the plane A = 1. 

Aymbol(-q, P, 1) = ^Weyl(q, P) (8.78) 

So the theory of Weyl symbols can be reinterpreted from the point of view of Zobin's 
double non- commutative Fourier transform for the Heisenb erg- Weyl group. Also, 
the Moyal star product for symbols of operators is simply the convolution for the 
non-commutative Fourier transform on the Heisenberg-Weyl group. 

8.5.3 The Star Product 

In the previous sections the Zobin symbol of an operator was defined, using 
the double Fourier transform for non-commutative groups [Equation (18.631) ]. This 
works well for finding the symbol of one operator. Often, a particular calculation will 
involve products of operators. In this case, the group theory point of view becomes 
very useful. The non-commutative Fourier transform of a product can be found 
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using the convolution theorem [Equation fl8.28p ]. Apphcation of the commutative 
Fourier transform then gives an equation for the symbol of a product. This symbol 
can be used to define a generalization of the Moyal star product for symbols. 
The convolution theorem gives the transform of a product of sections. 

^i(p)52(p)^\/i*/2)(^?) = I fi{9oh^')f2{h2)dh, (8.79) 

G 

Here the functions fi{g) and f2ig) are the inverse transforms of the sections Si{p) 
and 52 (p), respectively. Application of the commutative Fourier transform gives us 
the symbol, which can then be used to define the star product, denoted of two 
symbols. 

5i(p)52(p)-(/i*/2)(r) = J Jrih)Mhoh^')f2ih2)dhdh (8.80) 

G G 

j r{h o h2)fi{h)Mh2) dh, dh2 (8.81) 

G G 

Here /i(r) and /2(t) are the symbols of Si{p) and 5*2 (p). 

fjir) = J T{h)f,{h)dh (8.82) 

Go 

The argument r is one of the irreducible representations of the commutative group 
Go- We can insert the inverse of this relationship into Equation (18.811) in order have 
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a definition of tlie star product tliat involves fj^r) instead of fj{h). 

I \ ( ^ 

j Tl{h)f\{n)dn J Tl{h,)h{T^)dT, 

\Go 



(/i^/2)(r) = j j rihoh,) 

G G 




/i(ri)/2(r2) I J J r{hioh2)Tl{hi)rl{h2)dhidh2 | dTidT2 

Go Go 



dhidh2 
(8.83) 




/i(n)/2(T-2) K^{r, Ti, T2) dTidT2 



(8.84) 
(8.85) 



Go Go 



In the last line, the integrals over the group have been combined, and used to define 
the integral kernel K^{t, Ti, T2) of the star product. 



K^{t,Ti,T2) = j j T{hioh2)rl{hi)Tl{h2)dhidh2 

G G 



This kernel form of the star product is useful when evaluating the star product of 
more than two terms, e.g., for finding (/i*/2*/3)(^) in terms of the symbols /j(t). 

If the group multiplication o were commutative, then from Equation fl8.8ip we 
can see that the star product would reduce to the ordinary product of the symbols. 

(/l^/2)(r)=y" jT{h)T{h2)fl{h)f2{h2)dhdh2 (8.87) 
G G 



r{h)Mh) J T{h2)h{h2) 
G J \g 

But the group product is not commutative, and so the integral fails to simplify to an 
ordinary product. In the case when G is the Heisenberg-Weyl group, we can write 
hj in component form as hj = {zj, Xj), and the product as 

(8.89) 



h^oh2 = hi + h2+ [0, -uj{zi, Z2) ) . 
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Each of these terms is an element of the commutative group Gq, and so the repre- 
sentation r can be separated. 

T{hi o h2) = r(/ii) r(/i2) T (o, ^u{zu 22)^ ^ (8.90) 

The integral which defines the star product is then left with an extra phase, since 
we can write r as an exponent, r(/i) = exp(z(ar, h)). 

(/i*/2)(r) = J y"e*<'^-(°''^(^i'^^))>/M/ii)/i(/ii)^(^2)/2(/i2)rf/iirf/i2 (8.91) 

G G 

The phase in the exponent is what gives the star product its non-commutivity, and 
as we will see later, will provide the j {qdp — p dq) term in the phase space path 
integral which gives rise to Hamilton's equations in the semiclassical limit. 

The structure of this star product is why the double Fourier transform is used 
to define the symbol of an operator. If the non-commutivity of G depends on some 
small parameter, then, as the parameter goes to zero, the star multiplication of the 
symbols becomes ordinary multiplication of functions. This will find direct applica- 
tion in the semiclassical limit of quantum mechanics, and in the small wavelength 
limit of more generic wave equations. 



8.6 The "Symbol" of a Matrix 

The non-commutative double Fourier transform lets one associate a complex 
scalar function with a given operator. Here we show how this can be done for the 
example of an nxn matrix. The matrix is embedded in a section of the dual bundle 
of the discrete Heisenberg-Weyl group 9)ni and two group Fourier transforms are 
applied to find the symbol of the matrix. 

r(A(£),dP) L2{9)n,dg)''='' L2{Zyn,dN) L^il^/n.dN) (8-92) 
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The first, inverse, Fourier transform takes tlie section of the dual bundle to a function 
on the group. But the group is, as a set, just a lattice of points. So the second Fourier 
transform is an ordinary discrete Fourier transform. 

In this section, the symbols of specific matrices are calculated, and compared 
with the intuition developed by considering the Wigner function. Whenever pos- 
sible, the dimension n of the matrix will be left arbitrary. Specific examples and 
calculations will be performed for the case n = 3. 

8.6.1 Irreducible Representations of S^n 

The group theory for the discrete Heisenberg-Weyl group i3„ is described in 
Section I7.7.1[ The group consists of rv" points labeled by integers mod n 



and SO 5 cLS cL set, this group is equivalent to the integer lattice 1? jn. 

This group has inequivalent one-dimensional irreducible representations, and 
n — 1 inequivalent n-dimensional irreducible representations. The one- dimensional 
irreducible representations are labeled by a pair of integers M,f G 1? jn 



The 77,-dimensional irreducible representations involve shifts and multiplication by a 
phase. In matrix representation, these can be written as 



where a G Z„/0, S is the shift matrix, and T is the diagonal matrix of n**^ roots of 
unity, as in Equation (17.841) . 

Given these irreducible representations of we can construct the dual 
The dual has + n — 1 elements, one for each irreducible representation. The 



^n = {9 = {q,P, A) G Z/n © Z/n © Z/n}, 
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71^ one-dimensional representations are multiplications by a complex phase, so their 
associated fibers are just the space of complex numbers, C. The n — 1 ra-dimensional 
representations are given by matrices, so their fibers are the space n x n of matrices, 
GL{n). 



8.6.2 Sections of the dual bundle 

A section of the dual fiber bundle is an assignment of an operator to each point 
in S)n- For = 3, this means picking two 3x3 matrices, and 27 complex numbers. 



S 



pl - 


A 


P2 - 


^ B 




Co,o 


Qo,i - 


Co,i 



V 



.96) 



/ 



Since we are interested in the symbol of a single matrix, we will consider sections of 
the form 



Sa 



Pl - 


^ 


P2 - 


^ A 


Qofi ' 





^'0,1 - 






(8.97) 



where A is the matrix of interest. 



8.6.3 Representations of the commutative group Z 



In order to construct the symbol of a matrix, we will need the representations 
of the commutative group that is associated with This is the group formed by 
the finite lattice = Z/?T,©Z/n©Z/n. The group product law for this group is 
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normal addition mod n, so the irreducible representations are all one-dimensional. 
They are labeled by three integers u,v,w ^ Z^/n, and are given by 

/27ri \ 
ru,v,w{q,P,\) =exp l—{uq + vp + w\)j . (8.98) 

Since these are labeled by triples in Z^/n, the dual, Z^, is isomorphic as a set to 
jn. Also, because the representations are all one-dimensional, the fibers are all 
just GL(1) = C. 

8.6.4 The symbol of a section 

We can now calculate the symbol of the section Sa given in Equation fl8.97p . 
The first step in the double transform given in Equation fl8.92p is to perform the 
inverse transform of the section to a function on the group. The second step is to 
perform a discrete Fourier transform. Together, these transforms define the symbol 
of the section Sa{,P)- 

^(^) = E E {SA[p)p\g)) u,T{g) (8.99) 

Here Vp is a normalization (the Plancharel measure) associated with each represen- 
tation, Vp = l/dim(p). The inverse of this mapping can be used to reconstruct a 
section from a given symbol. 

Sa{p) = E E ^(^M9)P{9) (8.100) 

These equations can now be used to calculate specific examples. 

8.6.5 Delta functions in the symbols 

From the equations for the irreducible representations of ^„ and Z^, we can 
derive some useful general properties of the symbol of a section. In the case that 
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n = 3, each of the irreducible representations of Sjn are associated with a different 
subspace of Z^. In particular, with some algebra the following can be derived. 

If the section is only nonzero for the one-dimensional irreducible representations, 

then 

A{t) = A{u, v,w) = SAiga,b)Qi^bi9)Tu,v,wi9) (8.101) 

g a,b 

= ^ ^ Ca,b exp ( '^^{-aq - bp + uq + vp + w\) j (8. 102) 

g a,b ^ ' 

= ''^^Ca^b^u,a^v,b^w,0 = Cu^v^wfi (8.103) 
a,b 

If the section is only nonzero for one of the n-dimensional irreducible represen- 
tations Pa with a ^ 0, then 

A{u, v,w) = ^ UpjT {SA{pa)pU9)) ru,v,w{9) (8.104) 

9 

= J2 ^p.tr (^A(Pa)T2"^S'') exp ( — {-a\ - aqp + uq + vp + w\)j 

(8.105) 

= X ^ VpJ,Y [SaW)^''"'^'') exp ('^{-aqp + uq + vp)j 

q,p ^ 

(8.106) 

This calculation shows that the different values of w in the symbol correspond 
to different components of the section Sa- To each value of w there is associated a 
plane in the space which looks like a phase space. Sections which look like delta 
functions over ^„ transform into symbols constrained to one of these "phase space" 
components of Z^. This is a specific example of the calculation described in Section 
18.5.21 where it was shown how the Weyl symbol can be calculated by transforming 
a particular embedding of an operator. 
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8.6.6 Example: Wigner function analogy 

It is instructive to calculate the symbol for matrices that are constructed from 
the outer product of a vector with itself. Such a matrix is a projection matrix, and 
is the discrete version of the quantum mechanical projection operator. The Weyl 
symbol of the quantum projection operator is the Wigner function, so we might 
expect the symbol of this projection matrix to behave like the Wigner function of a 
quantum state. 

In particular, the Wigner function of a position eigenstate is a delta function 
in the position coordinate, and a constant in momentum. The discrete version of a 
position eigenstate is a vector with only one nonzero element, e.g. (1,0,0). Figure 
(18. 4p shows that the symbol of this discrete version is also a delta function in position 
and constant in momentum. 

Similarly, the Wigner function of a plane wave (a momentum eigenstate) is a 
delta function in momentum and a constant in position. This is exactly what we 
see in Figure (18.51) . where the symbol of the projector onto a discrete version of the 
plane wave is shown. 

8.6.7 Star Product for Matrices 

When using the symbol calculus, it is often necessary to calculate the symbol 
of a product of operators. In the case of symbols of matrices, this calculation can be 
done once for all by finding the kernel of the star product, which will be a discrete 
version of Equation (18.861) . This kernel can then be used to compute the star product 
rule for symbols of matrices. 

The discrete version of the star product looks like a contraction of a rank three 
tensor. 
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1.5^ 
1., 



( I 

0.5^ » . 




FIG. 8.4: Symbol of projectors onto the discrete position eigenstates, (1,0,0), (0, 1,0) , 
and (0, 0, 1). Since the matrix is embedded in a deha-hke way as a section of the dual 
bundle, the symbol is restricted to a 3 x 3 "phase space". The values of the symbol 
on this phase space are shown here, and they resemble discrete versions of the Wigner 
function for continuous position eigenstates. 



q 




FIG. 8.5: Symbol of projectors onto the discrete plane waves, exp(— ^qp), for p = 
0,1,2. 
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Using the equation for the representations r, and the equation for the kernel, we 
can compute the elements of K^,. 



This summation can be directly computed, and used to create a numerical algorithm 
for calculating the star product. For example, this calculation was performed using 
Matlab, and the results can be used to quickly calculate the star product of two 
sections. As an example, here is the matrix K^^{t, ti, T2) for the particular choice of 



r= (0,0,1). 
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CHAPTER 9 



Functions of Operators and the Path 



Integral 



Introduction 

In this cliapter, we describe how to use the Zobin symbol theory to calculate 
the symbol for a function of an operator. While functions of operators are generally 
important, one particular example stands out, and that is the exponential of an 
operator. In order to connect this section to a problem in physics, we start with 
an example of how the exponential of an operator arises in the context of a generic 
wave equation. We then return to the context of the Zobin symbol theory, and 
describe how the path integral arises when calculating the symbol of a function of 
an operator. 

Consider a generic wave equation in a time-independent medium, for example 
the equation for the electric field in a nonuniform plasma. Such an equation can be 
written in integral form as 



Because this has the form of a convolution in time, we can Fourier transform in time 
to obtain 
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where the frequency now appears as a parameter. The integral kernel T){x,x';uj) 
encodes the physics of the problem, and solutions of this equation describe waves of 
frequency u in the plasma. Alternatively, this equation can be written in pseudod- 
ifferential form, for example by using the Weyl symbol calculus. Abstractly, such 
an equation can be written as an operator acting on some state: 

Du.\^) = 0. (9.3) 

The parameter uj appears because the time dependence has been removed by Fourier 
transforming with respect to time. If we are looking for narrow-banded solutions, 
then we can expand about ujq, the carrier frequency, to find: 

(^D{uJo) + {d^D){uj - ^o) + . . .) = 0. (9.4) 

If the variation of D{uj) is slow enough, then we can ignore the higher order terms 
in this series. Performing an inverse Fourier transform of the truncated series gives 
the equation 

(^DM + {d^D){idt - ujo)) IV) = (9.5) 

If we write the field as a carrier e~*^°* times an envelope, the ujq term can be removed: 

= [dm + {d^D){tdt - a;o)) e-^"°*|^) (9.6) 
= e-^-«* (d(cuo) + {d^D){id,)) 1^). (9.7) 

We can now drop the phase e"""^"*. Continuing the analysis of this equation with 
a generic operator involves inverting d^^D. However, since this calculation is only 
meant to illustrate the usefulness of the exponential of an operator, we make the 
simplifying assumption that d^^D is proportional to the identity: 

d^ID = id. (9.8) 



173 

This operator can be inverted, and we can now isolate the time derivative by mul- 
tiplying through with the inverse: 

0= (^Ditoo)+tdt)\^). (9.9) 

Now define a new operator 

H = -D{uJo). (9.10) 

With this definition, the equation for the envelope of the wave looks like the Schrodinger 
equation. 

(^H - tdt) \^) = (9.11) 

So a wave equation with the form of the Schrodinger equation is actually very gen- 
eral. We can expect evolution equations of this form to arise whenever we consider 
the dynamics of the envelope of a narrow-banded wavepacket. 

This equation, which has one derivative in time, can be solved by exponentiating 
the operator itH. 

mt)) = e''^\m) (9.12) 

The exponent of an operator can be computed in two different ways. The first, 
which usually only works in special cases, would be to find the eigenvalues and 
eigenfunctions of H. If the eigenf unctions are used as a basis, the the operator itH 
is diagonal with the eigenvalues on the diagonal. The exponential of the operator 
can then be calculated by exponentiating the eigenvalues. This is how we usually 
solve quantum problems, where the eigenvalues are related to the energies of the 
stationary states. If H is Hermitian, this diagonalization can always be done in 
principle, but in practice can be difficult. Also, physical insight might be lacking 
which could be gained using other asymptotic methods. 

The second option for finding the exponent is to define e**^ in terms of a series 
involving powers of H. Finite powers are perfectly well defined, but may become 
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cumbersome to compute for large orders. If instead we can compute the symbol of 
the operator and use it to find the symbol of e**"^, we would have an alternative 
approach to calculating e**^. 

jj power scries^ ^j^j^ 

Q-A Iq-1 (9.13) 

Symbol of H I^^LJI^ Symbol of e^*^ 
As we will see in this chapter, finding the symbol of the exponent requires calculating 
repeated star products. As the number of repeated star products goes to infinity, 
the formula that is obtained is a path integral expression for the symbol of the 
exponent of an operator. While this path integral may be no easier to compute 
than the eigenvalues and eigenfunctions of H, it can still be a useful tool. For 
example, since the phase which appears in the path integral is related to the classical 
action, various methods of semiclassical analysis can be applied to the path integral, 
providing physical insight into the systems being studied. 

In addition to describing how the path integral arises when calculating a func- 
tion of an operator, we will discuss two interesting observations about the path 
integral. The first is that when calculating the path integral solution for vector 
wave equations, the function which arises as the ray hamiltonian is not one of the 
eigenvalues of the dispersion matrix, but rather one of the diagonal elements of the 
matrix. This could lead to a new approach for ray-tracing approximations, which 
might be able to handle novel types of mode conversion that are not of the familiar 
"avoided crossing" form. For example, in multidimensional problems, mode conver- 



8 



271 ]. This implies 



sion can occur in regions where the rays have non-zero helicity 
that the mode conversion cannot be recast as an avoided crossing, and the stan- 
dard tools used for calculating the effect of the mode conversion cannot be used. 
While we do not claim to have solved these novel mode conversion problems, the 
new theoretical tools described in this dissertation may be useful for obtaining such 
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solutions. This is a topic for future research. 

The second interesting observation arises when considering the "path integral" 
that is obtained when calculating the exponent of a finite matrix. In this case, the 
symbol is a function over a discrete set of points, and the path integral is naturally 
interpreted as a sum over the space of probability distributions on this set of points. 
This suggests calculating the sums by using a maximum entropy argument instead 
of the integral along a path. If this interpretation can be extended to the continuous 
case, it could provide a new perspective on the meaning and use of the path integral. 
This measure-space interpretation will also provide a way to define and calculate 
the Fourier transform of functions in infinite- dimensional spaces. 



9.1 Previous Work on Path Integrals 



The use of path integrals in quantum mechanics was pioneered by Feynman 



28 



291] and Wiener 



301 1 . The treatment of 



though key ideas were anticipated by Wentzel 
classical wave equations, such as those in plasma physics, by path integral methods is 
mathematically identical to the quantum case, with the classical limit corresponding 
to the ray or WKB limit. The Feynman approach uses an integral over paths 
in configuration space, and this configuration-space path integral has become a 
standard formulation of quantum mechanics. Zee gives a nice introduction along 
the following lines 



311 ]. Suppose we want to find the probability of transitioning 
from the state \qi) to the state \qF) in the time T. This is given by the matrix 
element 



-iTH\ 



qi). 



(9.14) 



By inserting — 1 complete sets of states into this expression, we can write this 
transition probability in terms of the transition probability for short time steps 
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6t = T/N: 

„ N-l 

{qF\e-'^^\qi) =11 dq, {qF\e-'''^ \qN -i) {qN -i\e-'''^ \qN -2) ■ ■ ■ {qi\e-'''^ \qi) . 

(9.15) 

Since the Hamiltonian will in general involve both position q and momentum p 
operators, some care needs to be taken when calculating the j^^ term in this inte- 
gral. Evaluating the p terms in the p representation involves calculating a Gaussian 
integral, and (for a standard kinetic plus potential Hamiltonian) results in 

/ o ■ \ 1/2 f n- / \ 2 



(g.>i|e-'^*"k,)=^-^J -^StV{q,)j. (9.16) 

Inserting this back into the previous equation, and taking the limit N 00, gives 
us the path integral in configuration space: 

While this path integral was explicitly constructed from the perspective of con- 
figuration spacCjJt is possible to form a path integral from the perspective of phase 
space. Berezin 2l| showed that the phase-space path integral arises whenever cal- 
culating the Weyl symbol of a product of many operators. Calculating the symbol 
of the operator e^'^^ using the phase-space path integral then gives 

i!o[h^ii^)pi^)-pi-t)<im-H{q{t),p{t))\ dt jj^^^^ Dp{t). (9.18) 

This path integral is much more in the spirit of classical Hamiltonian mechanics, 
since p and q are treated on an equal footing. 

In this chapter we show how the configuration space and phase space path 
integrals are related, using group theoretic ideas from Chapter [71 We start by 
writing the path integral for a generic function of an operator, using the Zobin 
symbol theory. The example of the symbol for the finite Heisenberg-Weyl group 
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is worked through in detail We then outhne how this path integral on the dual 
group Go reduces to the phase-space path integral. This reduction is related to 
the reduction to the primary representations. Further reduction to the irreducible 
representations requires the selection of a "configuration space", and leads to the 
configuration-space path integral. To our knowledge the connection between the 
reduction of group representations (primary to irreducible) and the reduction of the 
phase-space path integral to the configuration-space path integral is new. 

9.2 Using Symbols to Calculate Functions of Op- 
erators 

Calculating a function of an operator can be difficult in many cases. However, 
the symbol of the operator can be used to find the symbol of the function of the 
operator. This symbol of the function can then be converted back into an operator, 
giving the desired function of the operator. In some cases, the symbol of the function 
can be used to develop asymptotic approximations to the operator. It is of great 
interest that many of these asymptotic approximations are "semiclassical" in fiavor, 
even when the problems being considered are not quantum mechanical. 

For the exponential function, this procedure can be illustrated in the following 
diagram. 

scries definition . - 

H ^ e**^ 

Q-| Tq (9.19) 

Symbol of H P^^*^ integral ^ gyj^l^^J Qf ^itH 

Instead of trying to go directly from the operator to the exponential (following 
the dashed arrow — — — -^), we first go down to its symbol using the inverse 
quantization mapping, Q^^. As we will show, the symbol of the exponent can be 
calculated from the symbol of the operator, by using a path integral. The result of 
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the path integral is another function, which we can quantize using Q to obtain the 
operator exp{itH). 

In order to find the symbol of exp{itH) from the symbol of H, we need to define 
what we mean by the exponential function. The ordinary exponential of a number 
can be defined using the limit 

e"= lim (1 + — ) . (9.20) 

Since we know how to calculate products of operators, we could use this expression 
to define the exponential of an operator: 



JtH 



lim I 1 



itH 



N 



(9.21) 



The operator H commutes with itself, so it acts like a c-number in this formula. 
For any fixed value of A^, this is simply the A^*'* power of the operator 1 + itH/N. 
Repeated application of the star product can be used to calculate the symbol of this 
power from the symbol of the operator: 

N products of operators 



Q 



-1 



itH 



N 



ItH 



1 + 



itH 

w 



1 + 



ItH 

w 



r 



(9.22) 



N star products 



1 + 



ItH 



N 



{T)'k ...-kQ 



-1 



itH 
1^ 



Q 



-1 



1 



ItH 



(r) (9.23) 



(9.24) 



This defines the -k exponent, *A^. Taking the large limit of this expression gives 
us a way to calculate the symbol of the exponential of the operator. 



Q 



-1 



itH 



(r) = hm Q 



-1 



1 



itH 



(9.25) 
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As we will show, evaluating this expression in the limit leads to a path integral 
on the space of r's. Since r is an irreducible representation of the commutative 
group Go, this will be an integral over paths in Go- While the path integral result 
is fairly generic for symbols defined for continuous groups, the limiting form of this 
expression looks quite different for discrete groups. In the discrete case there is no 
way to define continuous paths in the space Go, and we get a sum over densities 
instead of a path integral. 

Instead of computing the limiting form of Equation (19.251) for a generic group, 
we will give two examples. In the following sections we will describe how to calculate 
the exponent of a matrix using the repeated star product. This will illustrate the 
sum over densities. Then we will show how the path integral arises for continuous 
groups, by considering the example of the Heisenberg-Weyl group. 

9.3 The "Path Integral" for a Discrete Group 

The "path integral" for the discrete Heisenberg-Weyl group S)n is obtained 
by evaluating the expression in Equation (19.251) using the star product defined in 
Section [8. 6. 71 Using the kernel K^{r,ri,T2) from Equation (I8.108p . we recall that 
the star product can always be written as a tensor contraction. 



The symbol of powers of a matrix can also be computed using this kernel. If A{t) 
is the symbol of the section into which our matrix of interest was embedded, then 
the appropriate symbol for the /c*'* power of the matrix can be derived by induction 




(9.26) 
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from repeated application of the star product. 

[A^A^A]{t) = Y,KAr.Ti,r2)A{n)[A^ A]{t2) (9.27) 

Tl,T2 

= Y.Y. ^-^''^ r2)A{n)K,{T2, rs, T^)A{T-i)A{n) (9.28) 

T1,T2 T3,T4 

From this expression it is fairly evident what the A^*^' power would give. In order to 
simplify the expressions, define a new function 

MA(n,r2) = Y,KM.r'.T2)A{r'). (9.29) 

r' 

If we consider A{t) as a vector indexed by r, and K^, as a rank three tensor, then 
Ma is a matrix with elements 

WA]r,.r, = 5^[/C]n,r',rJ^]r'. (9.30) 
r' 

This lets us write the ★-powers A*'^{t) in terms of powers of the matrix Ma- For 
example, 

[A^A^A]{t) = J2MA{T,n)MA{n,T2)A{T2) (9.31) 
= [Ml-A]^ (9.32) 
Extending this to the A^*'' power is straightforward, and gives 

A'' star products 

A*^{t) =\Ai.A^...i.Aj{r) (9.33) 
= M-'-A]^. (9.34) 

As with any matrix multiplication, this can be written in terms of the elements of 
the matrices. Each multiplication requires the sum over the inner indexes, and so 
A^ — 1 multiplications will give sums over A^ — 1 indexes, 

[A]r,. (9.35) 

n T2 Tj^ 
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Note that since this is a power of a matrix, it will become dominated by the largest 
eigenvalue of Ma in the limit of large A^. This gives an alternative approach to the 
calculation of the powers of the matrix, but we have not pursued this approach. 
This form of the matrix multiplication is completely general. Because each of the 
variables Tj will take on all possible values, this means that all possible "paths" 
of length N will appear in this summation. It is worthwhile to take a moment to 
describe the notation that will be used for these paths. Because we are working 
with a finite group, there are finitely many irreducible representations. The set of 
irreducible representations is Go? and the number of irreducible representations is 
Nq^. This set of representations Go can be thought of as a discrete set of points, 
or a lattice of points. Give each point of Go a label, denoted by /. Then the set Go 
looks like a list of points, 

Go = {/i,/2,...,/^^ }. (9.36) 

In the sum over possible representations, the variable r,- will take on all Nq^^ possible 
values: 

r,- e {/i,/2,...,/iVgJ. (9.37) 

A path is then a choice of points /, one for each variable r. We will sometimes use 
the notation {r} to indicate a path: 

M = {ri,r2,...,rjv}. (9.38) 

Care must be taken so as not to confuse the subscripts. The subscript of the r 
variables runs from 1 to A^, the length of the path, and it indicates how far along 
the path we are. The subscript on the /'s runs from 1 to A^g^, and indicates which 
point in the lattice Go we are discussing. Figure WA\ gives a picture of what we mean 
by this notation. In that example, the "path" is an ordered list of three elements 
from Go, {r} = {Ti,T2,n) = {h^h^h)- 
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h h h h h * * • Ina 

FIG. 9.1: Example of a discrete path in a lattice of points. The path of three elements 
is given by (ri,r2,T3) = (h^kJs)- 

From this point, the derivation of the path integral for the particular case of 
the discrete Heisenberg-Weyl group will proceed in three steps. First, we will use 
the structure of the group to find an explicit expression for the matrix Ma- Second, 
this expression will be applied k — 1 times in succession, in order to evaluate the k*^ 
star product formula above. This will give an expression that can be interpreted 
either as a sum over "paths" , or as a sum over measures. The third step is to apply 
this sum over paths (or measures) to the limit formula in Equation (19.251) in order 
to find the symbol of the exponent. 



9.3.1 An expression for Ma for the Heisenberg-Weyl group 

We are now considering the case of the star product for the Heisenberg-Weyl 
group S^n, which, as a set, is formed of points. Using the definitions of the matrix 
Ma [Equation (I9.29P ] and the kernel of the star product [Equation (I8.108p ]. we have 
the following (up to normalization), 

MAin, r2) = J2Y.Yl ^ h2)r'{-hi)r2{-h2)A{T'). (9.39) 

t' hi h2 

This is an x matrix. Since we are interested in calculating with symbols instead 
of functions on the group, we will rewrite this expression as sums over r's instead 
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of sums over group elements h. Start by grouping the r' terms: 



hi 



^ri(/ii o /i2)r2(-/i2) 



hi 



^ri(/ii o/i2)r2(-/i2 



J2r'i-h)Air') (9.40) 

r' 

(9.41) 



where A is the (commutative) inverse Fourier transform of A. This expression is 
now a sum over the group, of a product of two functions on the group. We can use 
the Plancharel identity to write this as a sum of products of the Fourier transforms 
of the two functions. First give a name to the sum in the square brackets: 



Tl{hi;Ti,T2) = ^ri(/ii 0/12)^2 (-/i2) 



(9.42) 



The Plancharel identity gives us this new expression: 



M^(ri,r2) = 5^!m(r;ri,r2)A(r). 



(9.43) 



We now only have to find DJI{t; Ti, T2), which is the (commutative) Fourier transform 
of Tl{hi; Ti, T2), with respect to the variable hi. In order to reduce subscripts, define 
g = hi. The Fourier transform is then 



9^(r; Ti, T2) = -^(9) Y^TiigO h2)T2{-h2). 



(9.44) 



g h2 

In order to continue, use the following notation for the representations: 



T{g) = en('- 



(z-z+XX) 



(9.45) 



Each T is labeled by an element of the dual, (i. A) G Zi^. Also, we will write the 
symplectic product as either a two-form uj{zi, Z2), or in matrix notation 



z-^ ■ 3 ■ Z2. 



(9.46) 
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These, together with the group product law, imply the following: 

{z-z + XX + zi-{z + Z2)) > 

( 27ii ^ 
X exp <^ — (Ai(Ai + X2 + z'^ ■ 3 ■ Z2) - Z2- Z2- X2X2) > (9.47) 

= ^ exp I — [z ■ {z + zi) + Z2 ■ {zi - Z2) + XizJ ■ J ■ 22) I 

X ^ exp I ^ (A(A + Ai) + A2(Ai - A2)) | . (9.48) 

The sums over A and A2 give us two Kronecker delta functions, and the sum over z 
gives us anotheiQ: 

9^(r; Ti, T2) = 5(A + Ai)5(Ai - A2) J] ^(i + ii + ArJ ■ Z2) exp <^ — ((ii - Z2) ■ Z2) > 

22 

(9.49) 

= 5(A + Ai)5(Ai - A2) exp | ^ (-^(^i " ^2) ■ J"' ■ {z + zi)^ | 

(9.50) 

= HX + - A,) exp } . (9.51) 

Now put this back into the equation for M^: 

M^(r„ r2) = 5: A{z, A)5(A + A05(A, - A2) exp { ^ (^i^fl^Jiil + il) ^ | . 

2, A 

(9.52) 

This is the expression that we wanted to compute. We have written Ma without 
the sums over the group which we had previously used. 

Reductions of the matrix product 

In Section 18.5.21 we saw that a particular choice of section would give a symbol 
with a delta-like behavior in the A direction. We can follow up on that observation 



^In the limit of large n, the discrete group goes over to the continuous Hcisenberg-Weyl group, 
and these Kronecker delta functions go over to Dirac delta functions 
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by considering functions A{z, A) which have certain properties. For example, a 
delta-hke behavior could be replicated here, by assuming that 

A{z,X) = A{z)6{X-l). (9.53) 

Inserting this into the equation for Ma gives us a simplified formula: 

Ma{ti,T2) = 6{l + Xi)6{l + X2)Y,Mz)exp (-'^u;{zi-Z2,z + Zi)j ■ (9.54) 

While Ma is an x matrix, the form for A given in Equation (19.531) has reduced 
Ma essentially to an x matrix: 

MA{n, T2) = 5(1 + Ai)5(l + X2)Ma{zu h). (9.55) 

where the x matrix Ma is defined as 

Ma{zi-, h) = '^A{z) exp ['^uj{z + ii, ii - ^2) ) • (9.56) 

This reduction is reminiscent of the reduction of the regular representation to the 
primaries, which also left us with functions only on phase space. 

This suggests that similar such reductions are possible, by considering the re- 
duction of the primary representations to the irreducible representations. Since the 
regular representation and its reductions act on functions on the group, we need to 
perform an inverse Fourier transform of A back onto the group. This gives 

^(^) = E E e-'^^^e-^^^A(i)5(A - 1) (9.57) 
^e-^"M(2) (9.58) 

z 

= A{z)e-^^. (9.59) 

This is a covariant function on the group, as described in Equation fl7.36p . This 
function lives in the pi invariant subspace associated with the primary represen- 
tation. We could instead consider functions which hve in the subspace associated 



z A 
g-^A 
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with an irreducible representation. For example, we could consider functions in the 
subspace given in Equation (17.1151) . Such a function can be expanded on the basis 
functions given in Equation f l7.123p : 

A{g) = Y,ciM9)- (9.60) 

We can now insert the expression for (f)q{g), and perform a Fourier transform to get 
back to a function of r. This is done as follows: 

,0,0)o(0,p,A) (fi') (9.61) 

9 q p X 

= E E E Se-"^ V((g, p, A + qp)) (9.62) 

q p X 

q P X ^ ^ ' 

The sum over A will give a delta function in A as we had before, while the sum over 
p will give a delta function which can be used to evaluate the sum over q. Thus we 
obtain 

A{t) = 6{X - l)^agexp('^qqj exp ('^ {pp + qp)j (9.64) 

q ^ ' p ^ ^ 

= 5(A- l)^agexp(— ggj 5(]5 + g) (9.65) 
q ^ ^ 

= 6(\ — 1) exp I qp I a_p . (9.66) 

\ n J 

If we think of the coefficients function of p, then we can write A{t) in a 

way analogous to Equation (19.531) : 

A{t) = A{p)6CX - 1) exp(^^qp^ . (9.67) 

We can now see if the matrix Ma will reduce any further, given this form of A{t). 
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Inserting this expression for A{t) into Equation (19.521) gives 

Ma{ti,T2) = 6{1 + Ai)(5(l + A2) ^v4(p) exp(^^qp^ exp(^-'^uj{zi - Z2,z- zi)^ 

(9.68) 

= 5(1 + Ai)5(l + A2)e'^-(^^'-) Yl ^(P) (^^^P - ^^^1 - ^))) • 

(9.69) 

Writing out the symplectic product lets us gather terms which involve q, and sum 
over them. This gives us a delta function which can be used to evaluate the sum 
over p: 

Ma{ti, T2) = 6{1 + Ai)5(l + A2)e^"(^^'^^) 

(27ri \ 
—{qi-q2){pi-p2))j (9.70) 

= 5(1 + Ai)5(l + A2)exp( — -(u;(ii,i2) + (gi - q2){Pi - P2)) 1 A{p2 - pi) 

(9.71) 

If not for the symplectic product in the phase, this expression would depend on zi 
and Z2 only through their difference. It is important to note that the g's appear 
only in the phases. When the powers of this matrix are computed, this means that 
the sums over g's which will appear can be evaluated, leaving only sums over p's, 
which will lead to a path integral only in p. 



9.3.2 The power and the path integral on phase space 

We can now insert the expression for Ma from Equation (19.541) into Equation 
(I9.35p . the formula for the repeated star product of a symbol. In the formula for 



the A^*^ power of the symbol, we need the (A^ — 1)*'^ power of the matrix Ma- 
M^-\n, Tk) = J2---J2 ^'^^(^1' ^2) ■ ■ ■ Ma{tn-i, tn) (9.72) 

T2 TJV_1 

= 5(1 + X^)6{l + Xn)Y,---Y1 ^^(^1' ^2) ■ ■ ■ Ma{zn-i, zn). (9.73) 



22 Z]v-i 



Because we are using the reduced form of the matrix Ma, these sums can be thought 
of as a sum over paths, where a path is a length- sequence of points in the discrete 
phase space Z^. 

Substituting Equation fl9.56p into Equation (19.731) gives 

M^-\n, tm) = 5(1 + Ai)5(l +'Xn)J2---Y1 

Z2 2JV-1 

^ ■ ■ ■ ^ A{zi) exp ^^u;(zi + ii, zi - ^2)^ ■ ■ ■ 



Zl 2jv_i 



X y4(2;7v_i) exp uj{zn-i + z^-i, zn-i - zn) 

n 



(9.74) 

5(1 + Ai)5(l + •^Jv) X^X^ n ^(^i)exp f uj{zj + Zj,Zj - Zj+i) j 

{2} {2} i=i ^ ^ ^ 

(9.75) 

Here, we have grouped the sums into two sums over the paths defined as 

{z} = {z2, Z3, . . . , zn^i) (9.76) 
{z} = {zi,Z2,Z3,. . . ,zn-i). (9.77) 

This is now looking more like a "path integral" , where we sum over a set of possible 
paths, except that there seem to be two different paths, {z} and {z}. 

It turns out that this expression can be written in terms of only one path, {z}. 
In the process of summing over {z}, however, the symplectic term in the phase 
becomes more complicated. Careful algebra and a convenient change of variables 
simplify the resulting phase, but the details of the calculation are messy. The 
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variables {z} provide a nice sort of bookkeeping, recording which of the z^s meet in 
the symplectic phase, and with which sign. 

We will first directly perform the summation over {z}, and then describe a new 
way to evaluate the resulting path integral by interpreting it as a sum over proba- 
bility distributions of the points in phase space. We then return to the expression 
above with the two sets of paths, and outline a way to evaluate the matrix M^~^ 
in terms of a certain type of joint probability distribution of the points z and z. 
Because this approach uses two sets of paths, difficulties with the algebra in the 
direct calculation can be avoided. This comes at the cost of having to introduce 
the joint probabilities (as will be discussed later), but these can be dealt with using 
standard tools from information theory. 

Direct summation over the paths {z} 

One way to proceed with the evaluation of the matrix M^~^ is to sum over the 
{z} paths. The sums over each zj can be evaluated, starting with j = N — 1. This 
results in a delta function for which takes care of one more of the sums. This 

procedure can be continued, unraveling, as it were, the symplectic phase starting at 
the end. 








(9.79) 





ZN-j-i) (9.80) 
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This pattern of grouping of the variables Zj suggests a change of variables which 
may simplify the expressions obtained from this calculation. Define new variables 



^3 



'^ = Y,{-ir-^Z^.m. J = l,...,iV-l. (9.81) 



m=l 



The procedure used above, with summing over one variable giving a delta function 
for the next, is simplest to use if we assume that is odd. This means that we will 
be left with one delta function at the end, which will relate z'j^_^ to the specified 
parameters zi and zn- Written in the new variables, the (A^ — 1)*'* power of the 
matrix becomes 

/ 2711 

M^~'^{zi,)^i,zn,\n) = 5(1 + Ai)(5(l + AAr)exp u{zn,Zi) 



n 



N-l . X 

{.'} i=l ^ ^ 



(9.82) 

This form of the (A^ — 1)*^ power of Ma is valid when the symbol A is delta-like in 
the A component [Equation (19.531) ]. If instead, we are considering the case in Equa- 
tion (I9.67p . where the Fourier transform of the symbol is in the invariant subspace 
associated with an irreducible representation, then we get a different form for M^~^. 
Specifically, the q' dependence of A{z') will only be in the phase. This means that the 
sums over q' can be evaluated, leaving only the sums over p'. These remaining sums 
can be grouped into paths in "configuration" space (actually "momentum" space, 
but we know these are just different choices of Lagrange subspace). So the difference 
between the phase-space path integral and configuration-space path integral (a la 
Feynman) is that they arise from different reductions of the regular representation. 
When considering the reduction to the primary representations, we are left with 
functions on phase space, and the repeated star product leads to the phase-space 
path integral. On the other hand, when considering the reduction to irreducible 
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representations, we must choose a Lagrange subspace in phase space. The repeated 
star product then leads us to consider the path integral in the "configuration" space 
defined by the Lagrange plane. 

At this point, there are two ways to continue the calculation. The first way 
(which is more appropriate for the continuous group) is to write this as a path 



integral in phase space. 
Berezin and Shubin in 



his is the approach taken by previous authors, e.g., by 
2l|, and we will use this approach in the following section 
when we discuss the path integral for the continuous Heisenberg-Weyl group. A 
second, novel, way to proceed does not involve turning this sum into a path inte- 
gral. Instead, observe that for the discrete group we are working with, there are 
only finitely many values which the symplectic phase can take on. Instead of sum- 
ming over all possible paths {z}, we could group all the paths which give the same 
symplectic phase, and then sum over the possible values of the phase. The tricky 
part of the calculation becomes trying to count all the ways to get the same phase. 
Fortunately, maximum entropy arguments can be used to obtain expressions for 
the multiplicities of each phase. Once these multiplicity functions are calculated — 
which is work for the future — the remaining sum becomes simpler. This connection 
between path integrals and concepts from statistical mechanics is, we believe, new. 



9.3.3 From the path integral to the sum over measures 

We now proceed to calculating the repeated star product from a statistical point 
of view. Start by considering the sum over two paths, as given in Equation fl9.74p . 
Instead of reducing this to a sum over only one path as was done in the previous 
section, we start directly with Equation (19.741) . The symplectic phase involves terms 
from both the path {z} and from the path {z}: 

uj{zj + Zj, Zj - Zj+i) = uj{zj, Zj - Zj+i) - uj{zj, Zj+i). (9.83) 
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Notice that, while the first terms involves points from both paths, the second involves 
terms from only one path. We will consider this second term separately, before 
returning to the more complicated term which involves both paths. 

Consider the term in the sum over paths in Equation (19.741) which involves the 
symbol A. It is simply the product of the values of A, evaluated along a path: 

J] A{zj) = A{zi)A{z2) . . . A{z,) . . . A{zN-i). (9.84) 
i 

Each of the variables zj represents a point in the discrete phase space Z^, and 
therefore can take on only finitely many values. We can think of the points in phase 
space as points in a lattice. If we label the lattice points by /'s, then each of the 
variables zj will have some label as its value: 

Zj e {h,l2,---ln^}- (9.85) 

We can also consider the path {z} formed by the variables Zj-. 

{z} = {zi, Z2,...,Zj,...ZN)- (9.86) 

If we think of the index j as a sort of discrete "time" , then the path is a mapping 
from time to the points in phase space. 

z:{2,3,...,iV}^{/i,/2,...,/n2} (9.87) 

Similarly, a complex-valued function on phase space is simply a list of complex 
numbers, one for each point / in phase space. 

A:{l}^C (9.88) 
A{1) = Ai (9.89) 

The phase space point label / is used as a subscript for convenience. When written 
this way, the discrete nature of phase space is emphasized, and A can be thought 
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of as a length vector, since there are points in phase space. Now speciahze 
to the case where A is a term from the hmit formula for the exponential [Equation 

A{z) = (l + ^^(^)) ~ e'*^(^)/^. (9.90) 

We can now rewrite the product over the path as a weighted sum over the points of 
phase space: 

N-l 



it 



exp 



^ E I • (9-92) 

i&zi 



The incidence function A'^; simply counts how many times the variables zj take on the 
value I. Another way to think of Ni is in terms of the path {z}. In this context, A*"; 
is the number of times that someone following the path {z} would visit the location 
/, where H{z) takes the value Hi. Since the incidence function counts steps along a 
path, the sum over all values of I must be equal to the length of the path: 

J2Ni = N. (9.93) 

I 

Once the path is specified, the incidence A"; is completely determined. This change 
in the point of view allows us to switch the product of if's accumulated along the 
path for a weighted sum over phase space. Since the incidence A"; is the weight in 
the sum, we can see that it is playing the role of a measure on phase space. The 
product on the given path has been turned into a sum with a given measure. With 
proper normalization, this measure can be treated as a probability distribution on 
phase space. 

The phase term in the sum over paths can be manipulated in a similar way. 
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Start by considering only the term which involves the pair of points {zj, -Sj+i). 

T{{z}) = exp (^-^ J2 ^i+i) j • (9-94) 

Since the symplectic product takes two points as input, the phase has a matrix of 
possible values. 

[ijj]i^rn = 0</,m<n^ (9.95) 

We can now rewrite the phase, by converting the sum along the path into a sum 
over possible values of u. Now instead of just counting how many times each point 
appears in the path, we need to count how many times each possible pair of points 
appears as neighbors in the path. This gives us a coincidence table, which we can 
write matrix: 

Nz,m = # times the pair (/,m) appears in the path. (9.96) 

For a path of length A^, there are — 1 pairs of points. This implies that the sum 
of all the elements in the coincidence table is 

Y,^i,m = N-l. (9.97) 

l,m 

Also, this coincidence table is derived from a path {z}, which has a particular 
incidence function Ni. The coincidence table must be consistent with Ni, and this 
consistency constraint can be written in terms of the column and row sums of the 
table N: 



iV^ = J]N,,„ = J]N™,z. (9.98) 



With the matrices N and uj defined, the phase can now be written as a contraction 
of the matrices. 

N-l 

J2 ^{zj, zj+i) = J2 = N : w (9.99) 

j=l l,m 
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In order to simplify notation, the symbol ":" is used to denote the contraction of 
the matrices. Note that the matrix a; is antisymmetric, u)^ = —uj. That means 
that this contraction looks like an inner product: 

N : a; = -Tr(Na;'^) = (a;, N). (9.100) 

We can now rewrite T{{z}) in terms of the coincidence table N, which is cal- 
culated for the particular path {z}. 

T{{z}) = exp f—N : uj] (9.101) 



\ n 

This simplification of T will help give us a way to organize the paths in the sum over 
all paths in Equation fl9.74p . by grouping paths with the same coincidence table N. 

The double sum over paths and joint coincidence tables 

We now turn to the calculation of the phase which involves both paths, {z} 
and {z}. The phase which involved only one path was written this in terms of the 
two-point coincidence table N;,m; which counted how many times a particular pair 
of points appeared in one path. Now we have to count how many times the triple 
Zj,zi,Zm appears in the two paths. Introduce a triple-coincidence table, N/^mi,m2! 
which counts how many times Zi appears along with the pair i^i , • This triple- 
coincidence table must agree with the coincidence table Nmi,m2 for the path {z} 
and the incidence function A'^; for the path {z}: 

5]N;,^,,^, = Ni,^, and J2 ^l,rn,,m,= Ni. (9.102) 

I mi,m2 

In order to compute the phase from this triple-coincidence table, we need to be 
able to combine it with the matrix u: properly. The two terms in the symplectic 
product can be calculated from two different contractions of oj with ^i^m.j_,m2- These 
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contractions are 

Uj{Zj,Zj - Zj+i) ^U;i^rn^'^l,rm,m-mi = W : N (9.103) 

l,m mi 

and 

u{Zj,Zj+i) '^UJl,rn'^'Nl,rm,m2 = w : N. (9.104) 

l,m I 

Here we have defined a new coincidence table N as the sum 

N;,„ = ^N,,^,,„,. (9.105) 

nil 

Note that since this is not the coincidence table for only one path, it does not need 
to satisfy the same marginal sums as the ordinary coincidence table for one path. 
Specifically, the row and column sums give incidence functions for the two different 
paths: 

Ni = J2^i,m, (9.106) 

m 

and 

Nm = J2^i,rn. (9.107) 

I 

With these definitions, and the expressions in Equations fl9.92p and (19.1011) . we 
can rewrite Equation (19.741) as 

Mt\n, tn) = 5{1 + Ai)5(l + A^) J2 (jr E ^'^0 

^ f^7W(N)exp (^cu : N-u; : Nm . (9.108) 



X 

V N 



The tilde on the inner sum indicates that it is a sum over all possible triple- 
coincidence tables N which agree with the incidence functions A^; and iY^ for the 
two paths. We have introduced the multiplicity function A^(N) which counts the 
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number of triple-coincidence tables which give the same phase. This multiplicity 
function is necessary since there are generically many pairs of paths {z} and {z} 
which give the same N. Since each triple-coincidence table can only be consistent 
with one pair of incidence functions A'^; and Nm, the sum over A'^; and Nm together 
with the weighted sum over N gives the same result as the sum over all paths {z} 
and {z}. 

It is at this stage that we are ready to use some tools from statistical mechanics. 
We would like to use a maximum entropy calculation in order to evaluate the sum 
over coincidence tables. This can be done, and will give an expression for the sum in 
the limit of longer and longer paths. As the length of the path goes to infinity, we can 
use, for example, the generating function for digram frequencies given in Chapter 



22 of Jaynes [3^. This generating function can be used to calculate the probability 
of a pair of paths given that the path have a particular triple-coincidence table, 
p({2;}, {z}|N). From this probability distribution in "path space" we can calculate 
the multiplicity function 7V1(N). We are most interested in finding those pairs of 
paths associated with the maximum of A1(N). Such calculation is the subject of 
ongoing research, and is just one of the many new research directions based on the 
group theory point of view developed in this dissertation. 

9.4 The Path Integral for the Continuous Heisenber 
Weyl Group 

The phase space path integral arises when the symbol of the exponential of an 



2]J. As with the 



operator is computed in the context of the Heisenberg-Weyl group 
calculation for the discrete Heisenberg-Weyl group in the previous section, we start 
by computing the star product of a symbol with itself, repeated A^ times, and then 
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apply this to Equation (19.211) to find the exponent. 

In this section, the following notation will be used. An operator A will be 
embedded into a section Sa- Then the group Fourier transforms will be applied, 
giving the symbol of Sa and the commutative Fourier transform of the symbol. 

SA{p)^~aig)'^^^' air) (9.109) 

Here a(r) is the symbol of the section Sa{p), and the function on the group, a{g), 
is the (commutative) inverse Fourier transform of a(r) obtained at the intermediate 
step in the calculation. 

We now proceed to the iV-fold star product calculation. This star product is 
obtained by finding the symbol of an A^-fold product of operators. 

(ai*a2*...*a^)(r) = [Q-\Sa,Sa, ■ ■ ■ SAj,)]ir) (9.110) 

= [J=-^2„^ioJ^^\SA,SA,...SAMr) (9.111) 
We start by using the convolution theorem to calculate the inverse transform J^fj^- 

{J^^\Sa^Sa2 ■ ■ ■ Saj^)) " / ■ ■ ■ / ^i(fi'i)«2(fl'2) • • • aNigN)Sg{giOg20. . .ogN) fl dgj 

(9.112) 

This form of the convolution is nice, since it isolates g and provides a delta function 
which is useful for evaluating the next Fourier transform integral. 

[Q'\SA^...SAN)]{r) = J dgT{g) j ■ ■ ■ j ai{gi) . . . aN{gN)5g{giO ■ ■ ■ gN)\\dgj 



2ii+l 



(9.113) 



j ■■■ j o-iigi) ■ ■ ■ aN{gN)r{gi 0...0 gN)Yl dgj (9.114) 



p2n+l 



We can now use our knowledge of the groups Sj and M^"+^ to rewrite the term 
involving r. As a set, these two groups are the same, and can be separated into a 2n- 
dimensional phase space component, z, and an additional 1-dimensional component. 
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A. The dual can also be separated into components. Use the ~ notation to 

signify an element of the dual. I.e., write an irreducible representation of M^""'"'^ as 

r(g) = e'^^-^+'^^\ (9.115) 

With this notation, and the form of the group product law for the Heisenberg-Weyl 
group, we can now write the r term from the above integral as 

T{gi o . . . o = exp I? ■ (zi + . . . zjy) + A(Ai + . . . Xjy + ^f2(zi, . . . , zjv)) 

(9.116) 

= e^^(^^+-+^'-) exp (^z ■ (zi + . . . z^v) + ^^^(zi, . . . , zjv)^ 

(9.117) 

The symplectic products from gi o . . . o gjq have been grouped together into one 
term, which can be directly calculated, and is 

n(Zi, Z2, . . . , Zat) = u;(zi, Z2)+Co'(zi + Z2, Z3) + . . .+Cj(zi + Z2 + . . .+ZAr-l, Zat). (9.118) 

Prom Equation fl9.117p . it looks like the variables \j are separating out nicely. We 
might therefore try to integrate over these variables in Equation (19.1141) . We can 
do this if we assume that the functions CLj{gj) are separable. In a slight abuse of 
notation, write these functions as products. 

= bj{\j)aj{zj) (9.119) 

This form is motivated by calculation in Section 18. 5. 2^ where it is shown that the 
Weyl symbol is a special case of the Zobin symbol, which arises when the function 
a{g) satisfies exactly this sort of separability condition. (Notice that this condition is 
similar to the covariance condition given in Equation (17.361) . and used to decompose 
the regular representation.) 
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Putting all of this together, and plugging it into Equation (19.1141) . gives us a 
new expression. 



[Q~\Sa,...SaM^,X) 

^ j "' j iji j exp |i ■ ^ Zj + |l^(zi, . . . , zjv) j I Yl 

X f... f e*^(^^+-+^-)6i(Ai)...6Ar(A^)n^A, (9.120) 
The integrals over the A/s are ordinary Fourier transforms. The difficult part of the 



calculation arises when trying to simplify the z, integrals. Berezin and Shubin 



2l| 



evaluate an integral such as this and obtain the phase space path integral, while 



Ozorio de Almeida (25| interprets the phase Q in terms of the area of polygons in 
phase space. These approaches are similar to the calculation for the discrete group, 
which was done in the previous section. The details of one approach (due to N. 
Zobin) are given in Appendix \^ and result in the expression (NB the notation has 
been changed so it matches the notation used in this section): 



N+l ., N \ N N 



Jexp I -y io{zk+i - Zfc, Zk - Zfe_i) + J^Y1 ) n n ^^J' 



k=2 j=l / j=l j=l 

(9.121) 

In the limit — oo, the sums in the phase go over to integrals, and the integrals 
over Go become a path integral. 

Alternatively, we could rewrite this path integral in terms of measures on phase 
space, just as the discrete "path integral" was rewritten in the previous section. 
This idea will be explored further in the next section. 
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9.5 The path integral as a Fourier transform on 
the space of measures 

The group theoretical perspective that has been developed in this section leads 
naturally to an interpretation of the path integral as a Fourier transform on the space 
of measures. In order to motivate this interpretation, we will start by examining 
the formula for the discrete "path integral" given in Equation (19.1081) . Consider the 
phase space portion of that formula: 

Mr^(^i,%) = X:^^p(|E^'^') (^E-^(N)exp(^N:a;)j. (9.122) 

Recall that the incidence function A'^; counted how many times a path of length N 
(with endpoints Zi and zn) visited each point / in phase space. We then converted 
the sum over all such paths into a sum over all possible incidence functions A^;. Now 
consider the limit N ^ oo. In this limit, the endpoints of the path should not 
contribute to the sum, so let them be anything. Also, the term Ni/N becomes a 
probability density pi, and the sum over pi goes over to an integral: 

Mh = j exp(^itJ2PiH}j dfi{pi). (9.123) 

Here we have defined the measure dfi{pi) as 

dM = hm ^Y.M{N)exp(—N : . (9.124) 

This measure depends on pi through the constraint on the sum over coincidence 
tables N. The normalization factor -j^^yT has been introduced in order to keep 
dfi{pi) finite. The multiplicity 7W(N) is the number of paths with coincidence table 
N, and {n^)^ is the total number of paths of length A^. The ratio then looks like a 
probability on the space of probabilities pi. 

p{p,) = hm J2t^- (9-125) 

N ^ ^ 
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This probability is nearly the measure dfi{pi), except that it does not include the 
phase. We are now set up to interpret Mh as a Fourier transform. 

First, notice that the phase in Mh involves two functions of the discrete phase 
space. There are points in phase space, so Hi and pi can be thought of as vectors 
of length rP, i.e., they are points in an n^-dimensional space. The function is 

2 

also a function on : 

$ : M"' ^ C. (9.126) 

The (ordinary) Fourier transform of such a function is usually written 

^{ki) = J e'^''"P'^{pi)r'p. (9.127) 

Comparing this to Equation fl9.123p . shows that is actually the Fourier trans- 
form of ^{pi), evaluated at the point tHi. Notice that this is a Fourier transform 
with respect to the variable pi, which can also be thought of as probability density, 
or measure, on the discrete phase space. So the path integral can be viewed as a 
Fourier transform over the space of measures. 

This interpretation of the discrete path integral as a Fourier transform over 
the space of measures extends to the continuous path integral. This can be seen 
by letting the number of points in the discrete phase space become large. The 
hamiltonian function Hi will go over to the hamiltonian on the continuous phase 
space, H{z), and the probability pi will go over to a probability density, or measure, 
on phase space. The dot product will then become an integral: 

J^PiHi ^ f H{z)p{z)dz = {H{z),p{z)). (9.128) 
I 

This lets us write the path integral as an infinite-dimensional Fourier transform. 

M[H{z)]= [ e'^'''^'^'P^'^^^p{z)]di2[p{z)]. (9.129) 
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This can either be viewed as an alternative interpretation of the path integral, or as 
the definition of the infinite-dimensional Fourier transform. This insight is due to 
Zobin, and is a new way to understand the path integral. 



9.6 Summary 

In this chapter, the group theoretical foundations of path integrals were de- 
scribed. Starting with the theory of symbols presented in Chapter [HI we considered 
the symbol of a function of an operator. In particular, the symbol of the exponential 
of an operator was calculated using the power series definition for the exponential 
function. This results in an expression which requires the repeated application of 
the star product for the calculation of the N*^ order term in the series. In the 
limit of large A^, the star products in this series can be recast as a path integral. 
While this star-product approach to the path integral has been considered previ- 
ously 2l|, |25|, |33|] , we have shown that this approach is very general, as it is based 
on the non-commutative Fourier transform for an (almost) arbitrary group. 

As an example of the fiexibility of this approach, we show how to calculate the 
"path-integral" for the finite Heisenberg-Weyl group. The resulting sum over paths 
looks quite different than the ordinary path integral, since there are no continuous 
"paths" on finite groups. By examining this finite example, we were able to show 
how the path integral can be connected to ideas from statistical mechanics. The sum 
over all paths was converted into a sum over probability distributions in phase space. 
This opens up several avenues of future research, including the possibility of using 
maximum entropy arguments to evaluate the sum over distributions. Bringing these 
ideas back to the continuous path integral leads to the formulation of an "infinite- 
dimensional" Fourier transform; the Fourier transform over the space of measures. 

This group theoretical framework for understanding the path integral has also 
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shed new light on the connection between the phase-space path integral and the 
configuration-space path integral. The generic path integral which arises from this 
theory relies heavily on the Fourier transform for groups, in both its commutative 
and non-commutative versions. The Fourier transform itself is related to the reduc- 
tion of the regular representation into irreducible representations, so it is natural 
to consider the effect of such reductions on the path integral. We show how the 
phase-space path integral arises when considering the reduction to the primary rep- 
resentation, since (for the Heisenberg-Weyl group) the primary representation acts 
on functions on phase space. The further reduction to the irreducible representa- 
tions requires choosing a subspace of phase space as the "configuration space" . This 
choice leads to a reduction of the path integral to an integral over paths in config- 
uration space. Thus, by recasting the symbol of an operator as a double Fourier 
transform for the Heisenberg-Weyl group, we have provided a new group theoretical 
framework for phase-space and configuration-space path integrals. 



CHAPTER 10 

A Brief Survey of Connections to Mode 
Conversion Theory 



This chapter presents a cursory discussion of several areas of investigations 
suggested by the results of the previous chapters. This chapter is not meant to be a 
final report on finished research, but rather it is meant to convey the wealth of new 
questions and ideas suggested by this work. It is our hope that the ideas presented 
in this chapter convey the richness of this field, and the variety of the avenues of 
research which are now open for further study. 

There are three main ideas presented in this chapter. First, we will see that it 
might be useful to use the diagonals of the dispersion matrix as ray hamiltonians 
when using WKB methods for vector problems, rather than using the eigenvalues 
as is typically done. This requires, however, that the matrix operator be put into 
"normal form". The second idea is that it may be possible to combine the discrete 
and continuous Heisenberg-Weyl groups to form a "double symbol" for the operator- 
valued dispersion matrix which arises in vector wave problems. The third idea 
presented in this chapter relates to using the Wigner function to include the effects 
of turbulent fluctuations on mode conversions, by marginalizing the Wigner function 
over a system parameter whose value is uncertain. 
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10.1 Review of Vector WKB 
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All of these ideas presented in this chapter relate to vector wave problems. 
Recall that such a problem can be written in matrix form: 

/ 



D ■ 



V 



v 



0. 



(lo.i: 



D21 D22 

This equation is written as a matrix of operators acting on the field if^{x). If we 
want to use the mathematics of symbols to write this operator as a function in 
phase space, we would usually calculate the symbol of each element of the matrix 
separately. This defines the dispersion matrix, which is a matrix-valued function on 
phase space: 



D(a;, k) 



^ Dii{x, k) Duix, k) ^ 



V 



;io.2) 



D2i{x,k) 1)22(3;, A;) 

This matrix is the starting point for the construction of the ray-tracing approxima- 
tion. First we recall how ray-tracing methods can be derived from the phase space 
path integral, in the case of scalar wave equations. 
For a scalar wave equation, the path integral is: 

;«/o'^[5(9WpW-pW9W)-^^(9W.p{*))] dt jj^^^-^ Dp{t). (10.3) 

Hamilton's equations for the rays can be derived from this path integral by evalu- 
ating it using the stationary phase approximation. Write the phase in this integral 

as 



Sim 



T 

J Qz ■ J ■ z - i/(z)^ dt. 



(10.4) 



Variation of this with respect to the path, z(t) ^ z(t) + eC(t) gives, 

T 



oz 



^z-j-c + ^C-J-z-C-Vi/(z))di. 



(10.5) 
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Use integration by parts to move the time derivative from ^ to z: 

T 

^5[z(t)] = j (^-U .J-C + ^C-J-z-C - Vi/(z)^ dt (10.6) 



T 

= J (c- J-z-c- viy(z))rft (10.7) 



For this to equal zero for any variation, we must have 

J ■ z - V//(z) = 0, (10.8) 

which can be rearranged to obtain Hamihon's equations: 

dH 

g = — (10.9) 



V=—^- (10.10) 



dp 
dj 
dq 

These equations can be solved to find the rays, which are then used to construct 
approximate solutions. The function H{q,p) here is the symbol of the hamiltonian 
operator. When the wave equation to be solved is a vector wave equation, instead 
of a scalar equation, the symbol of the dispersion operator D is no longer a scalar 
function, and cannot be directly used to generate rays. 

The usual way to get around this is to use one of the eigenvalues of the dispersion 
matrix as the ray hamiltonian. Write the eigenvalues as functions of phase space: 
Da{x,k),a = 1,2. From Equation fllO.ll) . we can see that in order for a solution 
to be non-zero, it must be a zero eigenvector of the dispersion matrix. Using an 
eigenvalue as the ray hamiltonian ensures that the numerical value of will 
remain constant along a ray. So if the ray starts with initial conditions (xq, ko) such 
that Da{xo, ko) = 0, then will remain zero along a ray. The WKB method can 
then be used to construct an approximate solution by tracing out a family of such 
rays. 
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10.2 Diagonals as ray Hamiltonians for vector prob- 



lems 



As described above, the derivation of the ray-tracing equations works fine 
for problems with scalar Hamiltonian operators, such as we have in the scalar 
Schrodinger equation. For vector problems, we need to exponentiate the matrix 
of operators, D. In the derivation of the path integral from the theory of symbols, 
we considered the limit expression for the exponent. 



where 1 is the 2x2 identity matrix. It is this formula which suggests a possible 
new approach for a ray-tracing algorithm. 

Consider the symbol of this expression for the case where N = 2. This can be 
written using the star product: 



This star product will look like the ordinary matrix product, plus terms involv- 
ing Moyal corrections whose leading terms involve the Poisson bracket of the el- 
ements of the matrix. If the diagonal elements commute with the off-diagonals 
(e.g., {Dii,Di2} = 0), then the Moyal correction terms get pushed to one higher 
order. For example, the upper right off-diagonal term of this product, with Moyal 




(10.11) 



For the matrix of operators D, we might consider a similar expression 




(10.12) 




(10.13) 
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corrections, can be written (using e as an ordering parameter) as: 



it'D\ ( itB 



12 



+ + ( Y ) ^11 * ^12 + ( Y I ^12 * 1^22 



;i0.14) 



|(Di2 + /^2i) + f I ) [^^11^2 + ^^12^2 + e ({/^ii, /^i2} + {Du, D22}) + 0(e' 



|(Z}i2 + D21) + [I ) [DnDu + Di2/^22 + 



(10.15) 
(10.16) 



Additionally, as we know from the work on mode conversion problems, a nice 
polarization to choose is one where the diagonal terms represent the "uncoupled" 
wave modes, and the off diagonal terms represent a locally constant coupling. 

These observations suggest that it might be useful to choose the local polariza- 
tion basis along the ray to be one where the the diagonal elements commute with the 
off diagonal elements. In terms of the symbols of the matrix elements, this means 
that the symbols of the diagonals would Poisson commute with the symbols of the 
off-diagonals: 



{Du{x, k), D^2{x, k)} = {Dn(x, k), D^iix, k)] = 



(10.17) 



and 



{L>22(X, k), Du{x, k)} = {D22{X, k), D2i{x, k)} = 0. 



(10.18) 



341 ]. In this normal 



This defines a type of "normal form" for the dispersion matrix 
form, we want to interpret the diagonals as uncoupled modes, so we will propose that 
the diagonals be used as the ray hamiltonians for the rays of the two different modes. 
We would then update the polarization along the ray so as to keep the matrix in 
normal form. With the diagonals as the ray hamiltonians, the off-diagonals would 
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FIG. 10.1: A simple calculation which motivates the normal form approach. The disper- 
sion surface on the left is calculated using the determinant of the dispersion matrix, while 
the surfaces on the right are calculated using the diagonals of the matrix, in "normal 
form" . The coupling causes the surface on the left to reconnect in complicated ways, 
while mode conversion regions in the figure on the right can be quickly identified by the 
intersections of the two surfaces. The arrow indicates a region where the intersection is 
non-transverse, and therefore not solvable using standard mode conversion theory. 



remain constant along the ray: 



d 



da 



Du = {Dii,Di2} = 0. 



(10.19) 



This proposed normal form differs from the approach of Emmrich and Weinstein 



351], where the full Moyal corrections are treated order by order, but the expansion 
is taken about a fixed (but arbitrary) point in phase space. 

This normal form approach may be especially useful for solving mode conversion 
problems. Instead of the rays forming an avoided crossing as they do when using the 
eigenvalues as ray hamiltonians, we would instead find the mode conversion point 
where the two dispersion surfaces intersect. A mode conversion point (or mode 
conversion surface in higher dimensions) is then a solution to 



D^/{x,k) = D^/{x,k) = 



NFi 



(10.20) 



where the superscript is a reminder that the matrix must be put into normal form. 
Figure 110.11 shows an example which motivates this normal form approach. In 
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that figure is shown the dispersion surfaces for waves in a simple cold-plasma model 
of a tokamak 8|]. This simple model can be put into something approximating 
normal form everywhere in phase space by a particular change of polarization basis. 
The dispersion surface calculated from the determinant of D(x, k) has a complicated 
structure (left), which is a result of the "avoided crossings" near mode conversions. 
However, the "crossings" are not avoided when the zeros of the diagonals are plotted, 
and the mode conversion regions can be identified by the intersection of the two 
surfaces. Note that there are regions where the intersections are not transverse 
(indicated by the arrow in the figure on the right), and hence the usual mode 
conversion theory will not work. 



10.3 "Double" symbol for vector wave problems 

There is a second approach to vector wave problems which is suggested by 
the theory of symbols as described in this work. This goes back to the basic idea 
underlying the Zobin theory of symbols, which says that the symbol of an operator 
is a double Fourier transform defined on an appropriate non-commutative group. 
The dispersion operator that we are working with has both a continuous part — for 
example the usual symbols of each matrix entry are smooth functions on phase space 
— and a discrete part — the matrix structure itself. So according to the construction 
given in Chapter [71 we can compute the symbol of this operator- valued matrix if 
we have a group with an irreducible representation which is also an operator-valued 
matrix. This suggests that we should form a sort of "double" symbol. We can 
combine the continuous Heisenberg-Weyl group with the discrete Heisenberg-Weyl 
group (or any other group with finite matrix representations), and we will get the 
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type of irreducible representations we need. 

as a set 

L2(M'"+' ® Z„, dgdN) L2(M2-'^r^Z„, dP) 

(10.21) 

The double symbol of D calculated using this approach will be a function on phase 
space (corresponding to the symbol of the differential part of D), with an additional 
discrete parameter (corresponding to the discrete symbol of the matrix part of D): 

I) ^ D{x,k;j). (10.22) 

This can be thought of as a function on several copies of phase space, each of which is 
labeled by j. This new, double, symbol of the matrix operator D has an interesting 
mathematical structure, and its use in vector wave problems is an area of future 
research. 

10.4 Applications of the phase space picture: En- 
tropy, Mixed States, and Averaged Wigner 
Functions 

The problem of turbulent fluctuations in plasmas presents significant theoretical 
and experimental challenges. We describe here how the idea of the mixed state from 
quantum mechanics could possibly be used to model the uncertainty introduced 
by turbulence in a plasma. The Wigner function for the mixed state can look 
much more classical because of decoherence. Similarly, we might expect that the 
Wigner function for a pair of interacting wave modes would become more localized to 
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their respective dispersion surfaces when the uncertainty introduced by turbulence 
is taken into account. 



10.4.1 Mode Conversion Model 

As in Chapter [3l we use a simple model of two coupled oscillators to describe 
the mode conversion processes; 

Xi + Lulit) Xi + 7] {X2 - Xi) = (10.23) 

^2 + ujlit) X2 + V (xi - X2) = 0. (10.24) 

When the time dependent natural frequencies of the two oscillators are nearly equal, 
they can exchange energy. The dispersion functions for the uncoupled modes are 
given by 

D^{t,ij) = ij^-ijl{t), a = {1,2} (10.25) 

and the dispersion curves for the uncoupled modes are defined as the solutions to the 
equations Da = 0. Here we assume these curves cross in the time-frequency plane at 
the point (to,t^o)- This is the mode conversion point, and the amount of conversion 
will depend on how long the oscillators will stay in resonance. This will depend on 
how fast the dispersion function for one mode changes when following rays of the 
other mode; expressed using the Possion bracket this becomes B = |{-Di, -D2}|(to,wo)- 
Geometrically, this quantity is related to the angle between the two dispersion curves 
where they cross at the mode conversion point. 

As has been demonstrated using a variety of techniques 
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and discussed in Chapter [3] of this dissertation, the mode conversion can be described 
as a scattering process, with incoming and outgoing wave amplitudes and phases. 
The transmission and conversion coefficients are 

r = exp (-7r|r/|Vi3) (10.26) 



214 



1.4 
1.2 
S 1 

D i 

> 

I 0.8 
i 0.6 

3 

I 0.4 

< 

0.2 





10 
t 



x^Ct) 



15 



20 



FIG. 10.2: Absolute value of xi and X2, from numerical solution, superimposed on the 
WKB amplitude. Note that the jump in the WKB amplitude at the mode conversion 
point predicts the amount of mode conversion well. 



rjT {—i\ri\'^ / B) 

These predictions agree well with numerical solutions, as seen in Figure I10.2[ 



(10.27) 



10.4.2 Mixed State Wigner Function 

In the density matrix formulation of quantum mechanics, a mixed state is one 
which is written as 



'10.28) 



where the probabilities < pj < 1 sum to one, but are otherwise unconstrained. The 
Wigner function for this state is simply the weighted sum of the Wigner functions 
for the constituent states. 



Wj{q,p) 



dq' e^^'^V;(g+|)^,(g-|) 



(10.29) 



j 



(10.30) 
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This construction of a "mixed state" Wigner function can be thought of as a 
marginahzation over some parameters in the system which are not well known. 



10.4.3 Coupled Oscillator Wigner Function 

We can calculate the Wigner function for the coupled oscillator model by using 
the time-frequency version of the Wigner function. Also, since the states in the 
model are vectors, then the Wigner function becomes a matrix, with greek indices 
labeling the modes. 



The four components of this Wigner matrix are shown in Figure 110.31 along with 
the dispersion curves. Note the complicated interference patterns which extend well 
away from the dispersion curves. This is due to the fact that the Wigner function is 
non-local in time, and there is phase coherence for long times. Physically we expect 
turbulence to destroy this phase coherence. 

10.4.4 Turbulent Uncertainties 

Here we model the effect of turbulence as an uncertainty in the parameters 
of the model. Specifically, we take the mode conversion point from a Gaussian 
distribution centered at the original {to, ujq). This uncertainty is introduced into the 
model by modifying the parameters of the dispersion functions so that the dispersion 
curves rigidly shift from their original location. Hence, the elements of the Wigner 
matrix also rigidly shift, and the resulting ensemble of functions can simply be 
summed with the appropriate Gaussian weights. The resulting Wigner matrix will 
be a "mixed state" function, and shows much better confinement to the dispersion 
curves. The averaging procedure effectively washes out the fine scale structure in 
the Wigner function, resulting in a more "classical" distribution on phase space (see 




(10.31) 
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FIG. 10.3: Absolute value of the Wigner matrix and the dispersion curves for the cou- 
pled oscillator model of mode conversion in a plasma. In order to isolate the positive 
frequencies, the numerical solution plotted here was calculated using a complex solution. 
The Wigner function for a real-valued solution would also have support for negative 
frequencies, as well as an interference pattern near zero frequency. 
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Figure [TU.4p . There is a natural physical connection between such turbulence effects 
and the path integral approach described in this dissertation. This is an area for 
future investigation. 

This calculation illustrates the potential that the mixed state Wigner function 
has for analyzing mode conversion in plasmas with turbulent fluctuations. There 
are several important aspects which remain to be considered, however. First, the 
physical interpretation of the mixed state calculation should be clarified in this 
context. Here the analogy with semi-classical quantum systems will probably be 
fruitful. Second, there is the issue of finding an equation of motion for the mixed 
state Wigner function. In many quantum problems this is not too hard, since the 
Hamiltonian is the same for each state in the ensemble. In our case, however, each of 
the "states" is a solution to a slightly different equation. Do these somehow average 
into a "master equation" for the mixed state Wigner function? 
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FIG. 10.4: Absolute value of the Wigner matrix for the mixed state. This shows some 
broadening, but the function is now much better confined to the dispersion curves. The 
interference patterns seen in Figure (|10.3p have mostly been averaged out. 



CHAPTER 11 



Summary and Conclusion 

11.1 Summary of Part I 

In the first part of tliis dissertation, we described tfie pliase space tlieory of mode 
conversion. Chapter [2] gives an introduction to the phase space point of view for 
solving generic wave equations. The WKB method for construction of approximate 
solutions is reviewed, and connected to phase-space ray-tracing algorithms. Then, in 
Chapter [3|, the example of two coupled oscillators is given. If the natural frequencies 
of the oscillators are time dependant, and cross at some time, then this problem can 
be recast into a form which is mathematically very similar to a mode conversion 
problem. The complete description of the solution of this coupled oscillator problem 
provides a pedagogical introduction to the phase space techniques used in the theory 
of mode conversion. 

In Chapter m we apply these tools to a standard avoided crossing mode conver- 
sion. Usually, the solution of such a problem involves the linearization of the disper- 
sion function about the mode conversion point. A local solution is then constructed 
so that incoming and outgoing WKB solutions can be asymptotically matched, al- 
lowing the problem to be treated as a sort of "ray splitting" . We analyze the effects 
of the next order terms, and show how to construct a local solution which takes these 
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quadratic terms into account. By including the effects of the quadratic order terms, 
the region in which the matching can be performed has been enlarged substantially. 

11.2 Summary of Part II 

The phase space theory described in Part I of this dissertation makes extensive 
use of the theory of symbols of operators. In Part 11, the mathematical foundations 
of the theory of symbols is described. Because these mathematical foundations rely 
heavily on the theory of representations of groups, we first give a review of group 
theory in Chapter [7l The example of the Heisenberg-Weyl group is examined in 
detail, showing how the relationship between phase space and configuration space 
arises from the reduction of the regular representation of this group. 

Then, in Chapter [HI we describe how the symbol of an operator can be calcu- 
lated by a double Fourier transform. The operator is first embedded into a section 
of the dual bundle, which is like an operator-valued "function" on the set of irre- 
ducible representations of a non-commutative group. The non-commutative Fourier 
transform is then applied to convert this section into a function on the group, where 
the group is considered as a set. Then, using a commutative group structure on this 
same set, we perform another Fourier transform. The result is an ordinary complex- 
valued function, which is the Zobin symbol of the operator we started with. 

Using this definition of the symbol, as developed by Zobin, we proceeded to 
calculate the symbol of a function of an operator in Chapter O In particular, we 
considered the exponential of an operator, defined using a power series. The symbol 
of the A^*'' power of an operator was computed by using the A^*'^ star product of 
the symbol of the operator. In the limit of large A^, this repeated application of the 
star product can be written as a path integral, where the paths live in the dual to 
the commutative group. This general theory for path integrals was then illustrated 
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by explicitly calculating the repeated star product for the discrete Heisenberg-Weyl 
group. Since there are irreducible matrix representations of this group, the cal- 
culation presented can be used to calculate functions of a matrix. In particular, 
the exponential of a matrix leads to a discrete "path integral", which by grouping 
similar paths can be written in terms of a multiplicity function. This led to the 
consideration of the connections between path integrals and statistical mechanics. 
In addition, the multiplicity function gives rise to a probability distribution, or mea- 
sure, on the space of all possible paths. This leads to an interpretation of the path 
integral as a Fourier transform on the space of measures, which for the continuous 
Heisenberg-Weyl group becomes an infinite-dimensional Fourier transform. Consid- 
ering the path integral for the Heisenberg-Weyl group also shows which aspect of 
group theory underlies the connection between the phase-space path integral and 
the configuration-space path integral. Specifically, the reduction of the regular rep- 
resentation to the primary representations leads to consideration of functions on 
phase space. This leads to the phase-space path integral. Further reduction of the 
primary representations to irreducible representations involves functions on configu- 
ration space. This reduces the phase-space path integral to the configuration-space 
path integral. 

The new group-theoretical approach to path integrals developed in Chapter [9] 
has many potential applications. In Chapter [TD] we outline several ways which this 
new point of view could be applied to mode conversion theory. This chapter points 
out several avenues of current and future research. 

We first see how consideration of the star product formulation of the path inte- 
gral may lead to using the diagonals of the dispersion matrix as ray hamiltonians for 
constructing WKB solutions to vector wave problems. This requires the definition 
of a new "normal form" for the dispersion matrix, where the symbols of the diag- 
onal elements Poisson commute with the symbols of the off-diagonal elements. In 



222 

addition to simplifying ray-tracing algorithms, this could also help provide physical 
insight for vector wave problems with non-standard mode conversion geometries. 

A second avenue of research based on our new group theory perspective involves 
calculating a "double" symbol for vector wave problems. The wave operator for 
vector wave problems can be written as a matrix of (pseudo) differential operators. 
The ordinary symbol of each element of the matrix can be calculated, giving the 
dispersion matrix as a function of phase space. However, as we saw in Chapter 
[HI it is possible to calculate the "symbol" of a matrix. So we can calculate the 
discrete "symbol" of the dispersion matrix at each point in phase space. This gives 
a new "double symbol" of the wave operator, which is a function of several discrete 
variables in addition to being a function of the phase space variables. 

As a final potential topic of further research, we discuss in Chapter [TD] the possi- 
bility of using an averaged Wigner function to model the effects of turbulent plasma 
fluctuations on mode conversion. This approach is based on the connection between 
the Wigner function and the density matrix in quantum mechanics. In quantum 
mechanics, mixed state density matrices are used to model the decoherence of a 
quantum state due to interaction with the environment. This decoherence makes 
the Wigner function for a mixed state look like a more classical probability distri- 
bution on phase space. We suggest that a "mixed state" Wigner function could be 
used to describe mode-converting waves in a turbulent plasma. Preliminary calcu- 
lations suggest that this would make the Wigner function appear more "classical" , 
with amplitude confined to regions near the dispersion curves for the various wave 
modes. 

In conclusion, this dissertation explores the depth and richness of the phase 
space perspective. Traditional asymptotics can refine solutions to mode conversion 
problems, as was done with the higher order corrections in Chapter |H Additionally, 
because of the group theoretical foundations provided by the Zobin theory of sym- 
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bols, many new areas of research have been opened, with a wide range of potential 
apphcations. 



APPENDIX A 



Derivation of the Path Integral Formula 

The symbol of the exponential of an operator can be calculated from the symbol 
of an operator by making use of the properties of the Fourier transform for groups. 
The resulting formula is in the form of a path integral. In this appendix we give the 
details of the calculation leading up to the path integral. The resulting formula is 
defined for large A^, and in the limit N oo, this formula becomes a path integral 
(if a particular ordering is chosen). This calculation is due to Zobin, and we thank 
him for allowing us to reproduce it here. 

A.l Definition of a Symbol 

We start with the definition of a symbol in terms of the Fourier transform for 
groups. 

teGo g eGo = G tt eG (A.l) 

L2(Go,rf/i) L2{Go,dfi) = L2{G,dfi) L2{G,dP) (A-2) 

A symbol: S{t) E L2(Go,rf/i) (A.3) 
A section: ^(vr) = {J^GJ^GoS{T)){7r) = QS E L^iG, dP) (A.4) 
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A. 2 Symbol of the exponential of a section 



We are interested in functions of operators. In particular, we want the expo- 
nential of an operator. (5~^e***(r) = ? We will make use of the limit formula for 
the exponential, e'^' ^ (1 + )^. 
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At this point, we invoke the Plancharel theorem in Gq in order to change the integral 
from one over the group to one over its dual Gq . 

N / -J / ^-1 \ / \ \ \ N 



(A.12) 

/ e^(r; n, . . . , r^) n (l + ^) H dr, (A.13) 
At this point, we apply the formula for the exponential again, to get that 

In the limit oo, the sum in the exponential can be interpreted as an integral. 

A. 3 Calculation of Qn{t] • • • , ^a^) 

By definition, we have 

e7v(r; Ti, . . . , Tat) = {J^G^r^gN ■ ■ ■ gi)) (n, . . . , ttv) (A.15) 
= J ri{g^^) . . .TNigN^)T{gN ■ ■ ■gi)Yldgj. (A.16) 



<^0 



Now make a change of variables: 

hk = gkOgk^i---ogi ^ gk = hkOh~\ and g-^ = hk-ioh~^ (A.17) 
Putting this back into the equation above gives 

„ Af N 

e;v(r; Ti, . . . , rjv) = / T^h^) JJ ^fe(/ifc-i o K^) JJ rf/ifc (A.18) 

fc=i fc=i 

TV . ^ . N 

= / ^(^iv) n ( ^'^-i - - 2'^(/^fc-i, hk) j Yl dhk- (A.19) 
fc=i ^ ^ fc=i 
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(Note: the \uj{hk-i,hk) term comes from the group product law, and depends on 
the choice of group. For example, if we are computing with the discrete HW group, 
then we do not have the factor of |.) We now introduce some new notation so that 
this integration becomes easier to perform. We will write the r's in exponential 
form, and we will also separate the 2n-dim component from the 1-dim component 
in both the r's and the /I's. 



= {Zk-, Zk) 

hk = {hk, hk) 



(A.20) 
(A.21) 
(A.22) 



Here, the " refers to the 2?T,-dim component and the refers to the 1-dim component. 



Qn{t]Ti, ...,tn) 
tIJin) exp 

r(/;,7v) exp 



N 



i ^{zk, hk^i - hk) - - ^ ZkUj{hk-i, h 



k=l 
N 



. N 

2 



i ^{zk+i - Zk, hk) - - ^ ZkUj{hk-i, hk 

. k=l k=l 



k=l 
N 



N 



l[dhk (A.23) 

k=l 
N 

II dhk (A.24) 
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T{hN)T{hN) 



X exp 
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i ^(4+1 - 4, hk) + [zk+i - Zk)hk - 9 X] ^k^^Chk-i, h 



k=l 



. N 
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k=l 



N 



Y\ dhk dhk 



k=l 



(A.25) 



Now evaluate the integrals over h^. 
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dhr. 
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k=l 
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(A.26) 
(A.27) 
(A.28) 
(A.29) 



k=l 



Now insert this back into the equation for Oat. 
Qn{t;ti, . . . ,tn) 



N 



Y[S{z- Zk) / f(/i7v)exp 



fc=i 



N 



N 
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k=l 



k=l 
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Y\_dhk 



k=l 



(A.30) 



- Zk) 



k=l 



X / exp 
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N+1 
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Wdhk 



^ I X^^^fc ~ Zk+i,hk) - {z,hN) J + — ^uj{hk,hk- 

/ k=l J k=l 

(A.31) 

{Zi - Z2, Z2- Z3,..., Zn-1 - Zn, Zn - z) 

(A.32) 

This integral is in the form of a Fourier transform of a multidimensional Gaussian. 
This can be seen by organizing the variables into vectors: 



exp — ^ u{hk, hk^i] 



k=2 



f^z\ 



\zn) 



( z \ 

Z2 



Zn 



\hN j 



(A.33) 
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(The z in takes care of the T{h]\f) term.) We will also need the matrix given by 
the two- form u: 



W 



In matrix form, this looks like 



W 



iuj 
-ioj 



v 



iuj 
-iuj 



(A.35) 



The equation for B^v becomes 



exp {{W~'^{z'' - z), - ^}) 
^ydet{W) 



(A.36) 

±exp(-(W^(F-z),F-^). 

(A.37) 



A. 4 Putting it back together 

Putting everything back together we get a formula for the symbol of the expo- 
nential of a section: 

Q^' (l + f ) (r) = ± / e-^^^^'-^-'-^ exp |j 5(r,) j n ^^^^ (^-38) 



N+l 

%Z 



it 



N 



N 



± / exp I -y ^ ^(4+1 - Zk, Zk - Zk-l) + — ^S{Tj) J JJ dT, 
k=2 j=l J j=l 



(A.39) 

Notice that the expression in the exponential is a function of the Tj 's and r, which 



is written in terms of the matrix W as defined in the last section. 
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